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Abstract A search for the Standard Model Higgs boson produced in association with a top-quark pair, ttH, is 
presented. The analysis uses 20.3 fb“^ ofpp collision data at ^/s = 8 TeV, collected with the ATLAS detector at the 
Large Hadron Collider during 2012. The search is designed for the H ^ bb decay mode and uses events containing 
one or two electrons or muons. In order to improve the sensitivity of the search, events are categorised according to 
their jet and 6 -tagged jet multiplicities. A neural network is used to discriminate between signal and background 
events, the latter being dominated by tt-Tjets production. In the single-lepton channel, variables calculated using 
a matrix element method are included as inputs to the neural network to improve discrimination of the irreducible 
tt+bb background. No significant excess of events above the background expectation is found and an observed 
(expected) limit of 3.4 (2.2) times the Standard Model cross section is obtained at 95% confidence level. The ratio 
of the measured ttH signal cross section to the Standard Model expectation is found to be p = 1.5±1.1 assuming 
a Higgs boson mass of 125 GeV. 


1 Introduction 


The discovery of a new particle in the search for the 
Standard Model (SM) Higgs boson [4j|^ at the 
LHC was reported by the ATLAS and CMS col¬ 
laborations in July 2012. There is by now clear evidence 
of this particle in the H —>■ 77 , H ZZ^*^ —>■ 4£, 
H WW^*^ Miy and H —>■ TT decay channels, 
at a mass of around 125 GeV, which have strength- 


sitive through loop effects. Since yt is expected to be 
close to unity, it is also argued to be the quantity that 


might give insight into the scale of new physics 20 . 

The H —>■ bb final state is the dominant decay mode 
in the SM for a Higgs boson with a mass of 125 GeV. So 
far, this decay mode has not yet been observed. While 
a search for this decay via the gluon fusion process is 
precluded by the overwhelming multijet background, 
Higgs boson production in association with a vector bo- 


ened the SM Higgs boson hypothesis pp] of the ob- son (VH) (2^ or a top-quark pair (tt) significantly 


servation. To determine all properties of the new bo¬ 
son experimentally, it is important to study it in as 
many production and decay modes as possible. In par¬ 
ticular, its coupling to heavy quarks is a strong focus 
of current experimental searches. The SM Higgs bo¬ 
son production in association with a top-quark pair 
{ttH) p]p] with subsequent Higgs decay into bot¬ 
tom quarks {H —>■ bb) addresses heavy-quark couplings 
in both production and decay. Due to the large mea¬ 
sured mass of the top quark, the Yukawa coupling of 
the top quark {yt) is much stronger than that of other 
quarks. The observation of the ttH production mode 
would allow for a direct measurement of this coupling, 
to which other Higgs production modes are only sen- 


improves the signal-to-background ratio for this decay. 

This paper describes a search for the SM Higgs bo¬ 
son in the ttH production mode and is designed to be 
primarily sensitive to the H ^ bb decay, although other 
Higgs boson decay modes are also treated as signal. 
Figs.[^a) and[^b) show two examples of tree-level di¬ 
agrams for ttH production with a subsequent H ^ bb 
decay. A search for the associated production of the 
Higgs boson with a top-quark pair using several Higgs 
decay modes (including H —> bb) has recently been pub¬ 


lished by the CMS Collaboration 24 quoting a ratio 


of the measured ttH signal cross section to the SM 
expectation for a Higgs boson mass of 125.6 GeV of 

p = 2.8 ± 1 . 0 . 
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Fig. 1 Representative tree-level Feynman diagrams for the production of the Higgs boson in association with a top-quark pair 
{ttH) and the subsequent decay of the Higgs to bb, (a) and (b), and for the main background tt+bb (c). 


The main source of background to this search comes 
from top-quark pairs produced in association with ad¬ 
ditional jets. The dominant source is tt + bb produc¬ 
tion, resulting in the same final-state signature as the 
signal. An example is shown in Fig. [^c). A second con¬ 
tribution arises from ti production in association with 
light-quark (u, d, s) or gluon jets, referred to as ft-Flight 
background, and from tt production in association with 
c-quarks, referred to as ti+cc. The size of the second 
contribution depends on the misidentification rate of 
the algorithm used to identify &-quark jets. 

The search presented in this paper uses 20.3 fb“^ 
of data collected with the ATLAS detector in pp col¬ 
lisions at y/s = 8 TeV during 2012. The analysis fo¬ 
cuses on final states containing one or two electrons 
or muons from the decay of the tt system, referred to 
as the single-lepton and dilepton channels, respectively. 
Selected events are classified into exclusive categories, 
referred to as “regions”, according to the number of 
reconstructed jets and jets identified as &-quark jets 
by the 6 -tagging algorithm ( 6 -tagged jets or 6 -jets for 
short). Neural networks (NN) are employed in the re¬ 
gions with a significant expected contribution from the 
tiH signal to separate it from the background. Sim¬ 
pler kinematic variables are used in regions that are 
depleted of the tiH signal, and primarily serve to con¬ 
strain uncertainties on the background prediction. A 
combined fit to signal-rich and signal-depleted regions 
is performed to search for the signal while simultane¬ 
ously obtaining a background prediction. 


2 ATLAS detector 


The ATLAS detector 25 consists of four main sub¬ 


systems: an inner tracking system, electromagnetic and 
hadronic calorimeters, and a muon spectrometer. The 
inner detector provides tracking information from pixel 


and silicon microstrip detectors in the pseudorapidit 30 
range |? 7 | < 2.5 and from a straw-tube transition radi¬ 
ation tracker covering \r]\ < 2.0, all immersed in a 2T 
magnetic field provided by a superconducting solenoid. 
The electromagnetic sampling calorimeter uses lead and 
liquid-argon (LAr) and is divided into barrel (Ir^l < 
1.475) and end-cap regions (1.375 < \ri\ < 3.2). Hadron 
calorimetry employs the sampling technique, with ei¬ 
ther scintillator tiles or liquid argon as active media, 
and with steel, copper, or tungsten as absorber mate¬ 
rial. The calorimeters cover |? 7 | < 4.9. The muon spec¬ 
trometer measures muon tracks within It^I < 2.7 us¬ 
ing multiple layers of high-precision tracking chambers 
located in a toroidal field of approximately 0.5 T and 
1T in the central and end-cap regions of ATLAS, re¬ 
spectively. The muon spectrometer is also instrumented 
with separate trigger chambers covering \r]\ < 2.4. 


3 Object reconstruction 

The main physics objects considered in this search are 
electrons, muons, jets and 6 -jets. Whenever possible, 
the same object reconstruction is used in both the single¬ 
lepton and dilepton channels, though some small differ¬ 
ences exist and are noted below. 

Electron candidates are reconstructed from en¬ 
ergy deposits (clusters) in the electromagnetic calorime¬ 
ter that are matched to a reconstructed track in the 
inner detector. To reduce the background from non¬ 
prompt electrons, i.e. from decays of hadrons (in par- 

^ ATLAS uses a right-handed coordinate system with its 
origin at the nominal interaction point (IP) in the centre of 
the detector and the ^-axis coinciding with the axis of the 
beam pipe. The 2 ;-axis points from the IP to the centre of 
the LHC ring, and the y-axis points upward. Cylindrical co¬ 
ordinates are used in the transverse plane, (p being the 

azimuthal angle around the beam pipe. The pseudorapidity 
is defined in terms of the polar angle 6 as rj = — lntan(6/2). 
Transverse momentum and energy are defined as pt = P sin 9 
and Et = Esin6, respectively. 
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ticular heavy flavour) produced in jets, electron candi¬ 
dates are required to be isolated. In the single-lepton 
channel, where such background is signihcant, an 77 - 
dependent isolation cut is made, based on the sum of 
transverse energies of cells around the direction of each 
candidate, in a cone of size AR — ^{A(j)Y + (Arj)'^ = 
0.2. This energy sum excludes cells associated with the 
electron and is corrected for leakage from the electron 
cluster itself. A further isolation cut is made on the 
scalar sum of the track px around the electron in a 
cone of size AR = 0.3 (referred to as The lon¬ 

gitudinal impact parameter of the electron track with 
respect to the selected event primary vertex defined in 
Section]^ zq, is required to be less than 2 mm. To in¬ 
crease efficiency in the dilepton channel, the electron 
selection is optimised by using an improved electron 


identification method based on a likelihood variable 27 


and the electron isolation. The ratio of to the 

Px of the electron is required to be less than 0 . 12 , i.e. 
pij?ne30/p^ < 0.12. The optimised selection improves the 
efficiency by roughly 7% per electron. 

Muon candidates are reconstructed from track seg¬ 
ments in the muon spectrometer, and matched with 
tracks found in the inner detector 28 . The final muon 
candidates are refitted using the complete track infor¬ 
mation from both detector systems, and are required to 
satisfy |? 7 | < 2.5. Additionally, muons are required to be 
separated by AR > 0.4 from any selected jet (see below 
for details on jet reconstruction and selection). Further¬ 
more, muons must satisfy a px-dependent track-based 
isolation requirement that has good performance under 
conditions with a high number of jets from other pp 
interactions within the same bunch crossing, known as 
“pileup”, or in boosted conhgurations where the muon 
is close to a jet: the track px scalar sum in a cone of 
variable size AR < 10 GeV /pt^ around the muon must 
be less than 5% of the muon px. The longitudinal im¬ 
pact parameter of the muon track with respect to the 
primary vertex, zqj is required to be less than 2 mm. 


Jets are reconstructed from calibrated clusters 25 


29 built from energy deposits in the calorimeters, using 


the anti-fct algorithm 30-32 with a radius parameter 
R — 0.4. Prior to jet finding, a local cluster calibration 
scheme 33 34 is applied to correct the cluster ener¬ 
gies for the effects of dead material, non-compensation 
and out-of-cluster leakage. The jets are calibrated us¬ 
ing energy- and 77 -dependent calibration factors, derived 
from simulations, to the mean energy of stable parti¬ 
cles inside the jets. Additional corrections to account 
for the difference between simulation and data are ap¬ 
plied 35 . After energy calibration, jets are required to 
have Px > 25 GeV and I 77 I < 2.5. To reduce the con¬ 
tamination from low-px jets due to pileup, the scalar 


sum of the px of tracks matched to the jet and origi¬ 
nating from the primary vertex must be at least 50% 
of the scalar sum of the px of all tracks matched to the 
jet. This is referred to as the jet vertex fraction. This 
criterion is only applied to jets with px < 50 GeV and 
\V\ < 2.4. 

During jet reconstruction, no distinction is made 
between identified electrons and jet candidates. There¬ 
fore, if any of the jets lie AR < 0.2 from a selected 
electron, the single closest jet is discarded in order to 
avoid double-counting of electrons as jets. After this, 
electrons which are AR < 0.4 from a jet are removed 
to further suppress background from non-isolated elec¬ 
trons. 

Jets are identified as originating from the hadro- 
nisation of a 6-quark via an algorithm 36 that uses 
multivariate techniques to combine information from 
the impact parameters of displaced tracks with topo¬ 
logical properties of secondary and tertiary decay ver¬ 
tices reconstructed within the jet. The working point 
used for this search corresponds to a 70% efficiency to 
tag a 6-quark jet, with a light-jet mistag rate of 1%, 
and a charm-jet mistag rate of 20%, as determined for 
6 -tagged jets with px > 20 GeV and I 77 I < 2.5 in sim¬ 
ulated ti events. Tagging efficiencies in simulation are 
corrected to match the results of the calibrations per¬ 
formed in data 37 . Studies in simulation show that 


these efficiencies do not depend on the number of jets. 


4 Event selection and classification 

For this search, only events collected using a single¬ 
electron or single-muon trigger under stable beam con¬ 
ditions and for which all detector subsystems were op¬ 
erational are considered. The corresponding integrated 
luminosity is 20.3 fb“^. Triggers with different px thresh¬ 
olds are combined in a logical OR in order to max¬ 
imise the overall efficiency. The px thresholds are 24 or 
60 GeV for electrons and 24 or 36 GeV for muons. The 
triggers with the lower px threshold include isolation 
requirements on the lepton candidate, resulting in inef¬ 
ficiency at high Px that is recovered by the triggers with 
higher px threshold. The triggers use selection criteria 
looser than the final reconstruction requirements. 

Events accepted by the trigger are required to have 
at least one reconstructed vertex with at least hve asso¬ 
ciated tracks, consistent with the beam collision region 
in the x-y plane. If more than one such vertex is found, 
the vertex candidate with the largest sum of squared 
transverse momenta of its associated tracks is taken as 
the hard-scatter primary vertex. 

In the single-lepton channel, events are required to 
have exactly one identihed electron or muon with px > 
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25 GeV and at least four jets, at least two of which are 
6-tagged. The selected lepton is required to match, with 
AR < 0.15, the lepton reconstructed by the trigger. 

In the dilepton channel, events are required to have 
exactly two leptons of opposite charge and at least two 
6-jets. The leading and subleading lepton must have 
Pt > 25 GeV and px > 15 GeV, respectively. Events in 
the single-lepton sample with additional leptons passing 
this selection are removed from the single-lepton sample 
to avoid statistical overlap between the channels. In the 
dilepton channel, events are categorised into ee, pp and 
ep samples. In the ep category, the scalar sum of the 
transverse energy of leptons and jets, is required to 
be above 130 GeV. In the ee and pp event categories, 
the invariant mass of the two leptons, mu, is required 
to be larger than 15 GeV in events with more than 
two 6-jets, to suppress contributions from the decay of 
hadronic resonances such as the J/'0 and T into a same- 
flavour lepton pair. In events with exactly two 6-jets, 
mu is required to be larger than 60 GeV due to poor 
agreement between data and prediction at lower mu- A 
further cut on mu is applied in the ee and pp categories 
to reject events close to the Z boson mass: \mu — mz\ > 

8 GeV. 

After all selection requirements, the samples are dom¬ 
inated by tt-Tjets background. In both channels, se¬ 
lected events are categorised into different regions. In 
the following, a given region with m jets of which n are 
6-jets are referred to as “(TOj,nb)”. The regions with a 
signal-to-background ratio S/B >1% and S/Vb > 0.3, 
where S and B denote the expected signal for a SM 
Higgs boson with mn = 125 GeV, and background, re¬ 
spectively, are referred to as “signal-rich regions”, as 
they provide most of the sensitivity to the signal. The 
remaining regions are referred to as “signal-depleted re¬ 
gions” . They are almost purely background-only regions 
and are used to constrain systematic uncertainties, thus 
improving the background prediction in the signal-rich 
regions. The regions are analysed separately and com¬ 
bined statistically to maximise the overall sensitivity. 
In the most sensitive regions, (> 6j, > 4b) in the single¬ 
lepton channel and (> 4j, > 4b) in the dilepton channel, 
H ^ bb decays are expected to constitute about 90% 
of the signal contribution as shown in Fig. of Ap¬ 
pendix [A] 


Figurej^a) shows the S/'/B and S/B ratios for the dif- 

ferent regions under consideration in the single-lepton 
channel based on the simulations described in Sect. [5] 
The expected proportions of different backgrounds in 
each region are shown in Fig. j^b). The same is shown 
in the dilepton channel in Figs. [^a) and ib). 


5 Background and signal modelling 


After the event selection described above, the main 
background in both the single-lepton and dilepton chan¬ 
nels is tt-Tjets production. In the single-lepton channel, 
additional background contributions come from single 
top quark production, followed by the production of 
a IV or Z boson in association with jets (IV/Z-l-jets), 
diboson {WW, WZ, ZZ) production, as well as the as¬ 
sociated production of a vector boson and a tt pair, 
tt + V (V = W,Z). Multijet events also contribute 
to the selected sample via the misidentification of a 
jet or a photon as an electron or the presence of a 
non-prompt electron or muon, referred to as “Lepton 
misID” background. The corresponding yield is esti¬ 
mated via a data-driven method known as the “ma¬ 


trix method” 38 . In the dilepton channel, backgrounds 


containing at least two prompt leptons other than tt-|-jets 
production arise from Z-l-jets, diboson, and IVt-channel 
single top quark production, as well as from the ttV 
processes. There are also several processes which may 
contain either non-prompt leptons that pass the lepton 
isolation requirements or jets misidentified as leptons. 
These processes include IV-l-jets, ti production with a 
single prompt lepton in the final state, and single top 
quark production in t- and s-channels. Their yield is 
estimated using simulation and cross-checked with a 
data-driven technique based on the selection of a same- 
sign lepton pair. In both channels, the contribution of 
the misidentified lepton background is negligible after 
requiring two 6-tagged jets. 

In the following, the simulation of each background 
and of the signal is described in detail. For all MG sam¬ 
ples, the top quark mass is taken to be mt = 172.5 GeV 
and the Higgs boson mass is taken to be mu = 125 GeV. 


5.1 tt+jets background 


In the single-lepton channel, a total of nine indepen¬ 
dent regions are considered: six signal-depleted regions. The tt-Tjets sample is generated using the Powheg- 
(4j, 2b), (4j, 3b), (4j, 4b), (5j, 2b), (5j, 3b), (> 6j, 2b), Box 2.0 NLO generator [^-41 with the CTIO parton 

and three signal-rich regions, (5j, > 4b), (> 6j, 3b) and distribution function (PDF) set 42 . It is interfaced to 

6j, > 4b). In the dilepton channel, a total of six in- Pythia 6.425 43 with the CTEQ6L1 PDF set 44 

dependent regions are considered. The signal-rich re- and the Perugia2011G 45 underlying-event tune. The 

gions are (> 4j, 3b) and (> 4j,> 4b), while the signal- sample is normalised to the top-1—1-2.0 46 theoretical 
depleted regions are (2j, 2b), (3j, 2b), (3j, 3b) and (> 4j, 2b):alculation performed at next-to-next-to-leading order 
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Fig. 2 Single-lepton channel: (a) S/\fB ratio for each of the regions assuming SM cross sections and branching fractions, and 
mu = 125 GeV. Each row shows the plots for a specific jet multiplicity (4, 5, >6), and the columns show the 6-jet multiplicity 
(2, 3, >4). Signal-rich regions are shaded in dark red, while the rest are shown in light blue. The S/B ratio for each region is 
also noted, (b) The fractional contributions of the various backgrounds to the total background prediction in each considered 
region. The ordering of the rows and columns is the same as in (a). 
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Fig. 3 Dilepton channel: (a) The S/\^B ratio for each of the regions assuming SM cross sections and branching fractions and 
rriH = 125 GeV. Each row shows the plots for a specific jet multiplicity (2, 3, >4), and the columns show the 6-jet multiplicity 
(2, 3, >4). Signal-rich regions are shaded in dark red, while the rest are shown in light blue. The S/B ratio for each region is 
also noted, (b) The fractional contributions of the various backgrounds to the total background prediction in each considered 
region. The ordering of the rows and columns is the same as in (a). 
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(NNLO) in QCD that includes resummation of next-to- 


next-to-leading logarithmic (NNLL) soft gluon terms 47 - 
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The tt+jets sample is generated inclusively, but events 
are categorised depending on the flavour of partons that 
are matched to particle jets that do not originate from 
the decay of the tt system. The matching procedure is 
done using the requirement of AR < 0.4. Particle jets 
are reconstructed by clustering stable particles exclud¬ 
ing muons and neutrinos using the anti-Zcj algorithm 
with a radius parameter R = 0.4, and are required to 
have pt > 15 GeV and \r]\ < 2.5. 

Events where at least one such particle jet is matched 
to a bottom-flavoured hadron are labelled as tt+bb events. 
Similarly, events which are not already categorised as 
tt+bb, and where at least one particle jet is matched to 
a charm-flavoured hadron, are labelled as tt+cc events. 
Only hadrons not associated with b and c quarks from 
top quark and W boson decays are considered. Events 
labelled as either tt+bb or tt+cc are generically referred 
to as tf-l-HE events (HE for “heavy flavour”). The re¬ 
maining events are labelled as ft-Tlight-jet events, in¬ 
cluding those with no additional jets. 

Since Powheg-|-Pythia only models tt+bb via the 
parton shower, an alternative tt-Tjets sample is gener¬ 
ated with the Madgraph5 1.5.11 LO generator 


52 


using the CTIO PDF set and interfaced to Pythia 
6.425 for showering and hadronisation. It includes tree- 
level diagrams with up to three extra partons (includ¬ 
ing b- and c-quarks) and uses settings similar to those 
in Ref. [^. To avoid double-counting of partonic con¬ 
figurations generated by both the matrix element cal¬ 
culation and the parton-shower evolution, a parton-jet 
matching scheme (“MLM matching”) is employed. 

Fully matched NLO predictions with massive 6-quarks 
have become available recently 
with OpenLoops framework 
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within the Sherpa 
referred to in the 


following as SherpaOL. The SherpaOL NLO sam¬ 
ple is generated following the four-flavour scheme using 
the Sherpa 2.0 pre-release and the CTIO PDF set. The 
renormalisation scale (/tr) is set to /tr = !!»=( tbb 
where is the transverse energy of parton i, and the 
factorisation and resummation scales are both set to 


(Ft,* + E^i)/2. 

For the purpose of comparisons between tt-|-jets event 
generators and the propagation of systematic uncer¬ 
tainties related to the modelling of tt-|-HF, as described 
in Sect. HIT] a finer categorisation of different topolo¬ 
gies in tt-|-HF is made. In particular, the following cate¬ 
gories are considered: if two particle jets are both matched 
to an extra 6-quark or extra c-quark each, the event is 
referred to as ti + 66 or ti + cc; if a single particle jet is 
matched to a single 6(c)-quark the event is referred to 


as tt+b {tt+c); if a single particle jet is matched to a 66 
or a cc pair, the event is referred to as tt+B or tt+C, 
respectively. 

Figure]^ shows the relative contributions of the dif¬ 
ferent tt+bb event categories to the total ti+bb cross 
section at generator level for the Powheg-|-Pythia, 
Madgraph-|-Pythia and SherpaOL samples. It demon¬ 
strates that Powheg-I-Pythia is able to reproduce 
reasonably well the tt-|-HF content of the Madgraph 
tt+iets sample, which includes a LO tt -|- 66 matrix el¬ 
ement calculation, as well as the NLO SherpaOL pre¬ 
diction. 



Fig. 4 Relative contributions of different categories of tt+bb 
events in Powheg-|-Pythia, Madgraph+Pythia and Sher¬ 
paOL samples. Labels “tt-|-MPI” and “tt-|-FSR” refer to 
events where heavy flavour is produced via multiparton in¬ 
teraction (MPI) or final state radiation (FSR), respectively. 
These contributions are not included in the SherpaOL cal¬ 
culation. An arrow indicates that the point is off-scale. Un¬ 
certainties are from the limited MC sample sizes. 


The relative distribution across categories is such 
that SherpaOL predicts a higher contribution of the 
ti+B category, as well as every category where the pro¬ 
duction of a second 66 pair is required. The modelling of 
the relevant kinematic variables in each category is in 
reasonable agreement between Powheg-|-Pythia and 
SherpaOL. Some differences are observed in the very 
low regions of the mass and px of the 66 pair, and in 
the Pt of the top quark and tt systems. 

The prediction from SherpaOL is expected to model 
the ti + 66 contribution more accurately than both 
Powheg-I-Pythia and Madgraph-|-Pythia. Thus, in 
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the analysis tt+bb events are reweighted from POWHEG+ 
Pythia to reproduce the NLO tt+bb prediction from 
SherpaOL for relative contributions of different cat¬ 
egories as well as their kinematics. The reweighting is 
done at generator level using several kinematic vari¬ 
ables such as the top quark px, tt system px, and 
px of the dijet system not coming from the top quark 
decay. In the absence of an NLO calculation of ti+cc 
production, the Madgraph-|-Pythia sample is used 
to evaluate systematic uncertainties on the tt+cc back¬ 
ground. 

Since achieving the best possible modelling of the 
tt-|-jets background is a key aspect of this analysis, a 
separate reweighting is applied to tf-|-light and ti+cc 
events in Powheg-|-Pythia based on the ratio of mea¬ 
sured differential cross sections at ^/s = 7 TeV in data 
and simulation as a function of top quark px and ti 
It was verified using the simulation 
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system px 

that the ratio derived at a/s = 7 TeV is applicable 
to -/s = 8 TeV simulation. It is not applied to the 
ti+bb component since that component was corrected 
to match the best available theory calculation. More¬ 
over, the measured differential cross section is not sen¬ 
sitive to this component. The reweighting significantly 
improves the agreement between simulation and data 
in the total number of jets (primarily due to the ti sys¬ 
tem Px reweighting) and jet px (primarily due to the 
top quark px reweighting). This can be seen in Fig. 
where the number of jets and the scalar sum of the jet 
Px (iLx'^'^) distributions in the exclusive 2-6-tag region 
are plotted in the single-lepton channel before and after 
the reweighting is applied. 


5.2 Other backgrounds 

The W/Z+]Qts background is estimated from simula¬ 
tion reweighted to account for the difference in the WjZ 
Px spectrum between data and simulation [^. The 
heavy-flavour fraction of these simulated backgrounds, 
i.e. the sum of WjZ+th and WjZ+cc processes, is ad¬ 
justed to reproduce the relative rates of Z events with 
no 6-tags and those with one 6-tag observed in data. 
Samples of W/Z+]eXs events, and diboson production 
in association with jets, are generated using the Alp- 
GEN 2.14 59 leading-order (LO) generator and the 


CTEQ6L1 PDF set. Parton showers and fragmenta¬ 
tion are modelled with Pythia 6.425 for W/Z+]ets 
production and with Herwig 6.520 for diboson 
production. The kF-|-jets samples are generated with 
up to five additional partons, separately for IT-|-light- 
jets, Wbb+]eis, VFcc-|-jets, and VFc-|-jets. Similarly, the 
Z-fjets background is generated with up to five addi¬ 
tional partons separated in different parton flavours. 


Both are normalised to the respective inclusive NNLO 
theoretical cross section 


61 


. The overlap between WQQ 
{ZQQ){Q = b,c) events generated from the matrix el¬ 
ement calculation and those from parton-shower evo¬ 
lution in the VF-I-light-jet (Z-l-light-jet) samples is re¬ 
moved by an algorithm based on the angular separation 
between the extra heavy quarks: if AR{Q, Q) > 0.4, the 
matrix element prediction is used, otherwise the parton 
shower prediction is used. 

The diboson-l-jets samples are generated with up to 
three additional partons and are normalised to their 
respecitve NLO theoretical cross sections 


62 


Samples of single top quark backgrounds are gen¬ 
erated with Powheg-Box 2.0 using the CTIO PDF 
set. The samples are interfaced to Pythia 6.425 with 
the CTEQ6L1 set of parton distribution functions and 
Perugia2011C underlying-event tune. Overlaps between 
the ti and Wt final states are removed 63 . The sin¬ 


gle top quark samples are normalised to the approxi¬ 


mate NNLO theoretical cross sections 64 ■ 66 using the 
MSTW2008 NNLO PDF set 


67 68 


Samples of ti + V are generated with Madgraph 
5 and the CTEQ6L1 PDF set. Pythia 6.425 with the 
AUET2B tune 69 is used for showering. The tiV sam¬ 


ples are normalised to the NLO cross-section predic¬ 
tions |70l|71 


5.3 Signal model 

The tiH signal process is modelled using NLO ma¬ 
trix elements obtained from the HELAC-Oneloop pack- 
Powheg-Box serves as an interface to shower 


72 


age 

Monte Carlo programs. The samples created using this 


approach are referred to as PowHel samples 73 . They 


are inclusive in Higgs boson decays and are produced 
using the CTIOnlo PDF set and factorisation (/ip) and 
renormalisation scales set to /ip = MR = 

The PowHel tiH sample is showered with Pythia 


8.1 |74] with the CTEQ6L1 PDF and the AU2 underlying- 
The tiH cross section and Higgs boson 


75 


event tune 

decay branching fractions are taken from (N)NLO the¬ 
oretical calculations 


19 76 -82 , collected in Ref. 


In Appendix the relative contributions of the Higgs 
boson decay modes are shown for all regions considered 
in the analysis. 


5.4 Common treatment of MC samples 

All samples using Herwig are also interfaced to Jimmy 


4.31 84 to simulate the underlying event. All simulated 

to simulate photon 


samples utilise Photos 2.15 
radiation and Tauola 1.20 


85 


86 to simulate r decays. 
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Njets 


(a) 


xIO^ 



(b) 




Fig. 5 The exclusive 2-b-tag region of the single-lepton channel before and after the reweighting of the pt of the tt system 
and the pt of the top quark of the Powheg-|-Pythia ti sample. The jet multiplicity distribution (a) before and (b) after the 
reweighting; distributions (c) before and (d) after the reweighting. 


Events from minimum-bias interactions are simulated 
with the Pythia 8.1 generator with the MSTW2008 


LO PDF set and the AUET2 87 tune. They are su¬ 


perimposed on the simulated MC events, matching the 
luminosity profile of the recorded data. The contribu¬ 
tions from these pileup interactions are simulated both 
within the same bunch crossing as the hard-scattering 
process and in neighbouring bunch crossings. 

Finally, all simulated MC samples are processed thr¬ 


ough a simulation 88 of the detector geometry and 


response either using Geant4 89 , or through a fast 


simulation of the calorimeter response 90 . All simu¬ 


lated MC samples are processed through the same re¬ 
construction software as the data. Simulated MC events 
are corrected so that the object identification efficien¬ 


cies, energy scales and energy resolutions match those 
determined from data control samples. 

Figures [^a) and [^b) show a comparison of pre¬ 
dicted yields to data prior to the fit described in Sect.[^ 
in all analysis regions in the single-lepton and dilepton 
channel, respectively. The data agree with the SM ex¬ 
pectation within the uncertainties of 10-30 %. Detailed 
tables of the event yields prior to the fit and the cor¬ 
responding S/B and SjsfB ratios for the single-lepton 
and dilepton channels can be found in Appendix |B| 

When requiring high jet and 6-tag multiplicity in the 
analysis, the number of available MC events is signifi¬ 
cantly reduced, leading to large fluctuations in the re¬ 
sulting distributions for certain samples. This can neg¬ 
atively affect the sensitivity of the analysis through the 
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Fig. 6 Comparison of prediction to data in all analysis regions before the fit to data in (a) the single-lepton channel and (b) 
the dilepton channel. The signal, normalised to the SM prediction, is shown both as a filled red area stacked on the backgrounds 
and separately as a dashed red line. The hashed area corresponds to the total uncertainty on the yields. 


large statistical uncertainties on the templates and un¬ 
reliable systematic uncertainties due to shape fluctua¬ 
tions. In order to mitigate this problem, instead of tag¬ 
ging the jets by applying the ^-tagging algorithm, their 
probabilities to be 6-tagged are parameterised as func¬ 
tions of jet flavour, px, and t]. This allows all events in 
the sample before ^-tagging is applied to be used in pre¬ 


dicting the normalisation and shape after 6-tagging 91 


The tagging probabilities are derived using an inclusive 
tt-Tjets simulated sample. Since the 6-tagging probabil¬ 
ity for a 6-jet coming from top quark decay is slightly 
higher than that of a 6-jet with the same px and rj but 
arising from other sources, they are derived separately. 
The predictions agree well with the normalisation and 
shape obtained by applying the 6-tagging algorithm di¬ 
rectly. The method is applied to all signal and back¬ 
ground samples. 


6 Analysis method 

In both the single-lepton and dilepton channels, the 
analysis uses a neural network (NN) to discriminate 
signal from background in each of the regions with 
significant expected ttH signal contribution since the 
Sj^/B is very small and the uncertainty on the back¬ 
ground is larger than the signal. Those include (5j,> 
4b), (> 6j, 3b) and (> 6j, > 4b) in the case of the single¬ 
lepton channel, and (> 4j, 3b) and (> 4j, > 4b) in the 
case of the dilepton channel. In the dilepton channel, 
an additional NN is used to separate signal from back¬ 


ground in the (3j, 3b) channel. Despite a small expected 
S/VB, it nevertheless adds sensitivity to the signal due 
to a relatively high expected S/B. In the single-lepton 
channel, a dedicated NN is used in the (5j, 3b) region to 
separate tt-|-light from tt-THF backgrounds. The other 
regions considered in the analysis have lower sensitiv¬ 
ity, and use in the single-lepton channel, and the 

scalar sum of the jet and lepton px (Ht) in the dilepton 
channel as a discriminant. 


The NNs used in the analysis are built using the 
NeuroBayes 92 package. The choice of the variables 
that enter the NN discriminant is made through the 
ranking procedure implemented in this package based 
on the statistical separation power and the correlation 
of variables. Several classes of variables were consid¬ 
ered: object kinematics, global event variables, event 
shape variables and object pair properties. In the re¬ 
gions with > 6 (> 4) jets, a maximum of seven (five) 
jets are considered to construct the kinematic variables 
in the single-lepton (dilepton) channel, first using all the 
6-jets, and then incorporating the untagged jets with 
the highest px. All variables used for the NN training 
and their pairwise correlations are required to be de¬ 
scribed well in simulation in multiple control regions. 

In the (5j, 3b) region in the single-lepton channel, 
the separation between the tt-Tlight and tt-THF events 
is achieved by exploiting the different origin of the third 
6-jet in the case of tt-l-light compared to tt-|-HF events. 
In both cases, two of the 6-jets originate from the tt 
decay. However, in the case of tt-|-HF events, the third 
6-jet is likely to originate from one of the additional 
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heavy-flavour quarks, whereas in the case of tf-|-light 
events, the third 6-jet is often matched to a c-quark 
from the hadronically decaying W boson. Thus, kine¬ 
matic variables, such as the invariant mass of the two 
untagged jets with minimum AR, provide discrimina¬ 
tion between ft-Tlight and tt-THF events, since the lat¬ 
ter presents a distinct peak at the W boson mass which 
is not present in the former. This and other kinematic 
variables are used in the dedicated NN used in this re¬ 
gion. 

In addition to the kinematic variables, two variables 
calculated using the matrix element method (MEM), 
detailed in Sect. are included in the NN training in 
(> 6j, 3b) and (> 6j, > 4b) regions of the single-lepton 
channel. These two variables are the Neyman-Pearson 
likelihood ratio (Dl) (Eq. @) and the logarithm of 
the summed signal likelihoods (SSLL) (Eq. (HD- The 
Dl variable provides the best separation between tiJI 
signal and the dominant tt+bb background in the 
(^ 6j, > 4b) region. The SSLL variable further improves 
the NN performance. 

The variables used in the single-lepton and dilepton 
channels, as well as their ranking in each analysis re¬ 
gion, are listed in Tables and respectively. Eor the 
construction of variables in the (> 4j,> 4b) region of 
the dilepton channel, the two 6-jets that are closest in 
AR to the leptons are considered to originate from the 
top quarks, and the other two 6-jets are assigned to the 
Higgs candidate. 

Eigures and show the distribution of the NN 
discriminant for the ttH signal and background in the 
single-lepton and dilepton channels, respectively, in the 
signal-rich regions. In particular. Fig. j^a) shows the 
separation between the tt^-|-HF and tt-|-light-jet produc¬ 
tion achieved by a dedicated NN in the (5j, 3b) region 
in the single-lepton channel. The distributions in the 
highest-ranked input variables from each of the NN re¬ 
gions are shown in Appendix [C| 

For all analysis regions considered in the fit, the tiH 
signal includes all Higgs decay modes. They are also 
included in the NN training. 

The analysis regions have different contributions from 
various systematic uncertainties, allowing the combined 
fit to constrain them. The highly populated (4j, 2b) and 
(2j, 2b) regions in the single-lepton and dilepton chan¬ 
nels, respectively, provide a powerful constraint on the 
overall normalisation of the ti background. The (4j, 2b), 
(5j, 2b) and (> 6j, 2b) regions in the single-lepton chan¬ 
nel and the (2j, 2b), (3j, 2b) and (> 4j, 2b) regions in 
the dilepton channel are almost pure in tt-Tlight-jets 
background and provide an important constraint on ti 
modelling uncertainties both in terms of normalisation 
and shape. Uncertainties on c-tagging are reduced by 


exploiting the large contribution of IF —>■ cs decays 
in the 66-Flight-jets background populating the (4j, 3b) 
region in the single-lepton channel. Finally, the consid¬ 
eration of regions with exactly 3 and > 4 6-jets in both 
channels, having different fractions of ti+bb and ti+cc 
backgrounds, provides the ability to constrain uncer¬ 
tainties on the ti+bb and ti+cc normalisations. 


7 The matrix element method 


The matrix element method 94 has been used by the 


DO and CDF collaborations for precision measurements 
of the top quark mass 95p6 and for the observations of 
single top quark production Recently this tech¬ 

nique has been used for the tiH search by the CMS 


experiment 99 . By directly linking theoretical calcula¬ 


tions and observed quantities, it makes the most com¬ 
plete use of the kinematic information of a given event. 

The method calculates the probability density func¬ 
tion of an observed event to be consistent with physics 
process i described by a set of parameters a. This prob¬ 
ability density function Pi (a:|Q;) is defined as 


P, (a;|Q;) = exp/ ^ / dpAdpB f (pa) f (pb) 

(«) J (j) 

W {y\x) d<pN (y) 

and is obtained by numerical integration over the entire 
phase space of the initial- and final-state particles. In 
this equation, x and y represent the four-momentum 
vectors of all final-state particles at reconstruction and 
parton level, respectively. The flux factor P and the 
Lorentz-invariant phase space element d^jv describe 
the kinematics of the process. The transition matrix 
element Aii is defined by the Feynman diagrams of the 
hard process. The transfer functions W {y\x) map the 
detector quantities x to the parton level quantities y. 
Finally, the cross section normalises Pi to unity 
taking acceptance and efficiency into account. 

The assignment of reconstructed objects to final- 
state partons in the hard process contains multiple am¬ 
biguities. The process probability density is calculated 
for each allowed assignment permutation of the jets to 
the final-state quarks of the hard process. A process 
likelihood function can then be built by summing the 
process probabilities for the Np allowed assignment per¬ 
mutation, 


Vp 

Ci (a;|Q;) =^Pf(a;|a). 

p^l 


( 2 ) 
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Variable 


Definition 


Neyman-Pearson MEM discriminant (Eq. 

Scalar sum of the px divided by sum of the E for 
all jets and the lepton 

Pt of the hfth leading jet 

Second Fox-Wolfram moment computed using 

all jets and the lepton 

Average AR for all 6-tagged jet pairs 

Logarithm of the summed signal likelihoods (Eq. §) 

Mass of the combination of the two 6-tagged 
jets with the smallest AR 
Mass of the combination of a ^-tagged jet and 
any jet with the largest vector sum px 
AR between the two ^-tagged jets with the 
largest vector sum px 

AR between the lepton and the combination 
of the two 5-tagged jets with the smallest AR 
Mass of the combination of the two untagged jets 
with the smallest AR 

I. 5 A 2 , where A 2 is second eigenvalue of the 
momentum tensor 


NN rank 

> 6j,> 4b 

> 6j, 3b 

5j,> 4b 

5j, 3b 

1 

10 

- 

- 

2 

2 

1 

- 

3 

7 

- 

- 

4 

3 

2 

- 

5 

6 

5 

- 

6 

4 

- 

- 

7 

12 

4 

4 

8 

8 

- 

- 

9 

- 

- 

- 

10 

11 

10 

- 

11 

9 

- 

2 

12 

- 

8 

- 

- 

1 

3 

- 

- 

5 

- 

- 

- 

- 

6 

- 

- 

- 

7 

- 

- 

- 

9 

- 

- 

- 

- 

1 

- 

- 

- 

3 

- 

- 

- 

5 

- 

- 

- 

6 

- 

- 

- 

7 


D1 

Centrality 

Jet5 

Ft 

HI 

ARZ^ 

SSLL 

^min AR 
^bb 


max Pt 
™bj 


A TDmax Pt 

A pmin AR 


m: 


min AR 


Aplanb-jet 

Ko 


m 


m 


min AR 
bj 

max Pt 

'jj 


rrhad 

H rp 

™min AR 

max Pt 
™bb 

„min AR 
^T,uu 


^max m 


""bb 


AR 


"'jjj 


min AR 


93 


built with only 6-tagged jets 


Number of jets with px > 40 GeV 

Mass of the combination of a ^-tagged jet and 

any jet with the smallest AR 

Mass of the combination of any two jets with 

the largest vector sum px 

Scalar sum of jet px 

Mass of the combination of any two jets with 
the smallest AR 

Mass of the combination of the two 6-tagged 
jets with the largest vector sum px 
Scalar sum of the px of the pair of untagged 
jets with the smallest AR 
Mass of the combination of the two 6-tagged 
jets with the largest invariant mass 
Minimum AR between the two untagged jets 
Mass of the jet triplet with the largest vector 
sum Px 


Table 1 Single-lepton channel: the definitions and rankings of the variables considered in each of the regions where an NN is 
used. 


The process probability densities are used to distin¬ 
guish signal from background events by calculating the 
likelihood ratio of the signal and background processes 
contributing with fractions /bkg, 


fsig {x\a) = 


■^sig (i2?|Q;) 


/bkg'^bkg (a^lct) 

bkg 


(3) 


ma 


This ratio, according to the Neyman-Pearson lem- 
is the most powerful discriminant between 


signal and background processes. In the analysis, this 
variable is used as input to the NN along with other 
kinematic variables. 


Matrix element calculation methods are generated 
with Madgraph 5 in LO. The transfer functions are 
obtained from simulation following a similar procedure 


as described in Ref. 101 . For the modelling of the par- 


ton distribution functions the CTEQ6L1 set from the 


LHAPDF package 102 is used. 


The integration is performed using VEGAS 103 


Due to the complexity and high dimensionality, adap¬ 
tive MC techniques 104 , simplifications and approxi¬ 


mations are needed to obtain results within a reason¬ 
able computing time. In particular, only the numeri¬ 
cally most significant contributing helicity states of a 
process hypothesis for a given event, identified at the 
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NN rank 


Variable 

Definition 




> 4j,> 4b 

> 4j, 3b 

3j, 3b 

A max /\r) 

Maximum Arj between any two jets in the event 

1 

1 

1 


Mass of the combination of the two 6-tagged jets with 





the smallest AR 

2 

8 

“ 


Mass of the two 6-tagged jets from the Higgs candidate 




rribb 

system 

6 



AR 

AR between the Higgs candidate and the closest lepton 

4 

5 

- 


Number of Higgs candidates within 30 GeV of the Higgs 
mass of 125 GeV 

5 

2 

5 

A pinax PT 
^-^bb 

AR between the two 6-tagged jets with the largest 

6 



vector sum px 


0 

Aplanjet 

I. 5 A 2 , where A 2 is the second eigenvalue of the 
momentum tensor built with all jets 

7 

7 

- 

,™min m 

Minimum dijet mass between any two jets 

8 

3 

2 

^^max AR 

AR between the Higgs candidate and the furthest lepton 

9 

- 

- 


Dijet mass between any two jets closest to the Higgs 

10 


10 

"bj 

mass of 125 GeV 


H'j' 

Scalar sum of jet px and lepton px values 

- 

6 

3 

^^max m 

AR between the two 6-tagged jets with the largest 




invariant mass 


y 



Minimum AR between any lepton and jet 

Sum of the px divided by sum of the E for all jets and 

- 

10 

7 

Centrality 

both leptons 



max PT 

Mass of the combination of any two jets with the largest 





vector sum px 



y 

H4 

Fifth Fox-Wolfram moment computed using all jets and 
both leptons 

- 

- 

4 

jets 

Ft 

Px of the third leading jet 

- 

- 

6 


Table 2 Dilepton channel: the definitions and rankings of the variables considered in each of the regions where an NN is used. 


start of each integration, are evaluated. This does not 
perceptibly decrease the separation power but reduces 
the calculation time by more than an order of magni¬ 
tude. Furthermore, several approximations are made to 
improve the VEGAS convergence rate. Firstly, the di¬ 
mensionality of integration is reduced by assuming that 
the final-state object directions in 77 and (j) as well as 
charged lepton momenta are well measured, and there¬ 
fore the corresponding transfer functions are represented 
by 5 functions. The total momentum conservation and 
a negligible transverse momentum of the initial-state 
partons allow for further reduction. Secondly, kinematic 
transformations are utilised to optimise the integration 
over the remaining phase space by aligning the peaks 
of the integrand with the integration dimensions. The 
narrow-width approximation is applied to the leptoni- 
cally decaying W boson. This leaves three 6 -quark en¬ 
ergies, one light-quark energy, the hadronically decay¬ 
ing W boson mass and the invariant mass of the two 
6 -quarks originating from either the Higgs boson for 
the signal or a gluon for the background as the re¬ 
maining parameters which define the integration phase 


space. The total integration volume is restricted based 
upon the observed values and the width of the trans¬ 
fer functions and of the propagator peaks in the matrix 
elements. Finally, the likelihood contributions of all al¬ 
lowed assignment permutations are coarsely integrated, 
and only for the leading twelve assignment permuta¬ 
tions is the full integration performed, with a required 
precision decreasing according to their relative contri¬ 
butions. 

The signal hypothesis is defined as a SM Higgs bo¬ 
son produced in association with a top-quark pair as 
shown in Fig. [^a), (b). Hence no coupling of the Higgs 
boson to the W boson is accounted for in to 

allow for a consistent treatment when performing the 
kinematic transformation. The Higgs boson is required 
to decay into a pair of 6 -quarks, while the top-quark 
pair decays into the single-lepton channel. For the back¬ 
ground hypothesis, only the diagrams of the irreducible 
ti + 66 background are considered. Since it dominates 
the most signal-rich analysis regions, inclusion of other 
processes does not improve the separation between sig¬ 
nal and background. No gluon radiation from the final- 
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NN output 



(a) 


(b) 




(c) 


(d) 


Fig. 7 Single-tepton channel: NN output for the different regions. In the (5j, 3b) region (a), the tt+HF production is considered 
as signal and tt+light as background whereas in the (5j, > 4b) (b), (> 6j, 3b) (c), and (> 6j, > 4b) (d) regions the NN output 
is for the tiH signal and total background. The distributions are normalised to unit area. 



(a) (b) (c) 

Fig. 8 Dilepton channel: NN output for the ttH signal and total background in the (a) (3j, 3b), (b) (> 4j, 3b), and (c) 
(> 4j, > 4b) regions. The distributions are normalised to unit area. 
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state quarks is allowed, since these are kinematically 
suppressed and difficult to treat in any kinematic trans¬ 
formation aiming for phase-space alignment during the 
integration process. In the definition of the signal and 
background hypothesis the LO diagrams are required to 
have a top-quark pair as an intermediate state resulting 
in exactly four &-quarks, two light quarks, one charged 
lepton (electron or muon) and one neutrino in the h- 
nal state. Assuming lepton universality and invariance 
under charge conjugation, diagrams of only one lep¬ 
ton flavour and of only negative charge (electron) are 
considered. The probability density function calculation 
of the signal and background is only performed in the 
(> 6j, 3b) and (> 6j, > 4b) regions of the single-lepton 
channel. Only six reconstructed jets are considered in 
the calculation: the four jets with the highest value of 
the probability to be a &-jet returned by the 6-tagging 
algorithm (i.e. the highest 6-tagging weight) and two 
of the remaining jets with an invariant mass closest to 
the W boson mass of 80.4 GeV. If a jet is 6-tagged it 
cannot be assigned to a light quark in the matrix ele¬ 
ment description. In the case of more than four 6-tagged 
jets, only the four with the highest 6-tagging weight are 
treated as 6-tagged. Assignment permutations between 
the two light quarks of the hadronically decaying W 
boson and between the two 6-quarks originating from 
the Higgs boson or gluon result in the same likelihood 
value and are thus not considered. As a result there 
are in total 12 and 36 assignment permutations in the 
(> 6j,> 4b) and (> 6j, 3b) region, respectively, which 
need to be evaluated in the coarse integration phase. 

Using the ttH process as the signal hypothesis and 
the tt + 66 process as the background hypothesis, a 
slightly modified version of Eq. (§ is used to define 
the likelihood ratio Dl: 


background and/or the shape of their final discrimi¬ 
nant distributions. Individual sources of systematic un¬ 
certainty are considered uncorrelated. Correlations of 
a given systematic effect are maintained across pro¬ 
cesses and channels. Table presents a summary of 
the sources of systematic uncertainty considered in the 
analysis, indicating whether they are taken to be nor¬ 
malisation-only, shape-only, or to affect both shape and 
normalisation. In Appendix the normalisation im¬ 
pact of the systematic uncertainties are shown on the 
tt background as well as on the ttH signal. 

In order to reduce the degradation of the sensitivity 
of the search due to systematic uncertainties, they are 
fitted to data in the statistical analysis, exploiting the 
constraining power from the background-dominated re¬ 
gions described in Sect.|^ Each systematic uncertainty 
is represented by an independent parameter, referred to 
as a “nuisance parameter”, and is fitted with a Gaussian 
prior for the shape differences and a log-normal distri¬ 
bution for the normalisation. They are centred around 
zero with a width that corresponds to the given uncer¬ 
tainty. 


8.1 Luminosity 


The uncertainty on the integrated luminosity for the 
data set used in this analysis is 2.8%. It is derived fol¬ 


lowing the same methodology as that detailed in Ref. 105 
This systematic uncertainty is applied to all contribu¬ 
tions determined from the MG simulation. 


8.2 Uncertainties on physics objects 
8.2.1 Leptons 


Dl 


_ ^ttH _ 

^UH + a • ^tt+bb 


(4) 


where a = 0.23 is a relative normalisation factor chosen 
to optimise the performance of the discriminant given 
the finite bin sizes of the Dl distribution. In this defi¬ 
nition, signal-like and background-like events have Dl 
values close to one and zero, respectively. The logarithm 
of the summed signal likelihoods dehned by Eq. ([^ and 
the ratio Dl are included in the NN training in both 
the (> 6j, 3b) and (> 6j, > 4b) regions. 


8 Systematic uncertainties 

Several sources of systematic uncertainty are consid¬ 
ered that can affect the normalisation of signal and 


Uncertainties associated with the lepton selection arise 
from the trigger, reconstruction, identification, isolation 
and lepton momentum scale and resolution. In total, 
uncertainties associated with electrons (muons) include 
five (six) components. 


8.2.2 Jets 


Uncertainties associated with the jet selection arise from 
the jet energy scale (JES), jet vertex fraction require¬ 
ment, jet energy resolution and jet reconstruction effi¬ 
ciency. Among these, the JES uncertainty has the largest 
impact on the analysis. The JES and its uncertainty are 
derived combining information from test-beam data, 


LHC collision data and simulation 35 . The jet energy 


scale uncertainty is split into 22 uncorrelated sources 
which can have different jet px and ry dependencies. In 
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Systematic uncertainty 

Type 

Comp. 

Luminosity 

N 

1 

Physics Objects 

Electron 

SN 

5 

Muon 

SN 

6 

Jet energy scale 

SN 

22 

Jet vertex fraction 

SN 

1 

Jet energy resolution 

SN 

1 

Jet reconstruction 

SN 

1 

6-tagging efficiency 

SN 

6 

c-tagging efficiency 

SN 

4 

Light-jet tagging efficiency 

SN 

12 

High-px tagging efficiency 

SN 

1 

Background Model 

ti cross section 

N 

1 

ti modelling: px reweighting 

SN 

9 

tt modelling: parton shower 

SN 

3 

66-|-heavy-flavour: normalisation 

N 

2 

ti-\-cc: Px reweighting 

SN 

2 

tt-\-cc: generator 

SN 

4 

ti-\-bb: NLO Shape 

SN 

8 

lU-bjets normalisation 

N 

3 

W Px reweighting 

SN 

1 

Z-bjets normalisation 

N 

3 

Z Px reweighting 

SN 

1 

Lepton misID normalisation 

N 

3 

Lepton misID shape 

S 

3 

Single top cross section 

N 

1 

Single top model 

SN 

1 

Diboson-bjets normalisation 

N 

3 

ti-\-V cross section 

N 

1 

ti -\-V model 

SN 

1 

Signal Model 

tiH scale 

SN 

2 

tiH generator 

SN 

1 

tiH hadronisation 

SN 

1 

tiH PDF 

SN 

1 


Table 3 List of systematic uncertainties considered. An “N” 
means that the uncertainty is taken as normalisation-only for 
all processes and channels affected, whereas an “S” denotes 
systematic uncertainties that are considered shape-only in all 
processes and channels. An “SN” means that the uncertainty 
is taken on both shape and normalisation. Some of the sys¬ 
tematic uncertainties are split into several components for a 
more accurate treatment. This is the number indicated in the 
column labelled as “Comp.”. 


this analysis, the largest jet energy scale uncertainty 
arises from the rj dependence of the JES calibration in 
the end-cap regions of the calorimeter. It is the second 
leading uncertainty. 


8.2.3 Heavy- and light-flavour tagging 


A total of six (four) independent sources of uncertainty 


affecting the &(c)-tagging efficiency are considered 37 


Each of these uncertainties corresponds to an eigenvec¬ 
tor resulting from diagonalising the matrix containing 
the information about the total uncertainty per jet px 
bin and the bin-to-bin correlations. An additional un¬ 
certainty is assigned due to the extrapolation of the b- 
tagging efficiency measurement to the high-px region. 
Twelve uncertainties are considered for the light-jet tag¬ 
ging and they depend on jet px and r]. These systematic 
uncertainties are taken as uncorrelated between 6-jets, 
c-jets, and light-flavour jets. 

No additional systematic uncertainty is assigned due 
to the use of parameterisations of the 6-tagging proba¬ 
bilities instead of applying the 6-tagging algorithm di¬ 
rectly since the difference between these two approaches 
is negligible compared to the other sources. 


8.3 Uncertainties on background modelling 
8.3.1 tt-hjets modelling 


An uncertainty of -|-6.5%/-6% is assumed for the in¬ 
clusive tt production cross section. It includes uncer¬ 
tainties from the top quark mass and choices of the 
PDF and as- The PDF and as uncertainties are calcu¬ 
lated using the PDF4LHC prescription 106 with the 
MSTW2008 68% CL NNLO, CTIO NNLO and 
NNPDF2.3 5f FEN 108 PDF sets, and are added in 
quadrature to the scale uncertainty. Other systematic 
uncertainties affecting the modelling of tf-|-jets include 
uncertainties due to the choice of parton shower and 
hadronisation model, as well as several uncertainties re¬ 
lated to the reweighting procedure applied to improve 
the ti MC model. Additional uncertainties are assigned 
to account for limited knowledge of tt-THF jets produc¬ 
tion. They are described later in this section. 

As discussed in Sect. to improve the agreement 
between data and the tt simulation a reweighting proce¬ 
dure is applied to ti MC events based on the difference 
in the top quark px and ti system px distributions be¬ 
tween data and simulation at ^/s = 7 TeV 57 . The 
nine largest uncertainties associated with the experi¬ 
mental measurement of top quark and ti system px, 
representing approximately 95% of the total experimen¬ 
tal uncertainty on the measurement, are considered as 
separate uncertainty sources in the reweighting applied 
to the MC prediction. The largest uncertainties on the 
measurement of the differential distributions include ra¬ 
diation modelling in ti events, the choice of generator 
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to simulate tt production, uncertainties on the compo¬ 
nents of jet energy scale and resolution, and flavour 
tagging. 

Because the measurement is performed for the in¬ 
clusive ti sample and the size of the uncertainties appli¬ 
cable to the tt+cc component is not known, two addi¬ 
tional uncorrelated uncertainties are assigned to tt+cc 
events, consisting of the full difference between apply¬ 
ing and not applying the reweightings of the ti system 
Pt and top quark px) respectively. 

An uncertainty due to the choice of parton shower 
and hadronisation model is derived by comparing events 
produced by Powheg interfaced with Pythia or Her- 
WIG. Effects on the shapes are compared, symmetrised 
and applied to the shapes predicted by the default model. 
Given that the change of the parton shower model leads 
to two separate effects - a change in the number of jets 
and a change of the heavy-flavour content - the parton 
shower uncertainty is represented by three parameters, 
one acting on the tt-flight contribution and two oth¬ 
ers on the ti+cc and ti+bb contributions. These three 
parameters are treated as uncorrelated in the fit. 

Detailed comparisons of ti+bb production between 
Powheg-I-Pythia and an NLO prediction of ti+bb pro¬ 
duction based on SherpaOL have shown that the cross 
sections agree within 50% of each other. Therefore, a 
systematic uncertainty of 50% is applied to the ti + bb 
component of the ti+}ets background obtained from the 
Powheg-|-Pythia MC simulation. In the absence of 
an NLO prediction for the ti+cc background, the same 
50% systematic uncertainty is applied to the ti+cc com¬ 
ponent, and the uncertainties on ti + bb and ti+cc are 
treated as uncorrelated. The large available data sample 
allows the determination of the ti+bb and ti+cc normal¬ 
isations with much better precision, approximately 15% 
and 30%, respectively (see Appendix [Pj). Thus, the final 
result does not significantly depend on the exact value 
of the assumed prior uncertainty, as long as it is larger 
than the precision with which the data can constrain 
it. However, even after the reduction, the uncertainties 
on the ti+bb and the ti+cc background normalisation 
are still the leading and the third leading uncertainty 
in the analysis, respectively. 

Four additional systematic uncertainties in the ti+cc 
background estimate are derived from the simultaneous 
variation of factorisation and renormalisation scales, 
matching threshold and c-quark mass variations in the 
Madgraph-|-Pythia ti simulation, and the difference 
between the ti+cc simulation in Madgraph-|-Pythia 
and Powheg-I-Pythia since Madgraph-|-Pythia in¬ 
cludes the ti+cc process in the matrix element calcula¬ 
tion while it is absent in Powheg-|-Pythia. 


For the ti+bb background, three scale uncertainties, 
including changing the functional form of the renormal¬ 
isation scale to /iR = changing the func¬ 

tional form of the factorisation /xp and resummation 
fiQ scales to /ip = /xq = Ili^t tbb varying the 

renormalisation scale /xr by a factor of two up and down 
are evaluated. Additionally, the shower recoil model un¬ 
certainty and two uncertainties due to the PDF choice 
in the SherpaOL NLO calculation are quoted. The ef¬ 
fect of these variations on the contribution of different 
ti+bb event categories is shown in Fig.|^ The renormal¬ 
isation scale choice and the shower recoil scheme have 
a large effect on the modelling of ti + bb. They provide 
large shape variations of the NN discriminants result¬ 
ing in the fourth and sixth leading uncertainties in this 
analysis. 

Finally, two uncertainties due to ti+bb production 
via multiparton interaction and final-state radiation which 
are not present in the SherpaOL NLO calculation are 
applied. Overall, the uncertainties on ti + bb normali¬ 
sation and modelling result in about a 55% total un¬ 
certainty on the ti + bb background contribution in the 
most sensitive (> 6j, > 4b) and (> 4j, > 4b) regions. 

8.3.2 The W/Z+jets modelling 

As discussed in Sect. the IT/Z-l-jets contributions 
are obtained from the simulation and normalised to the 
inclusive theoretical cross sections, and a reweighting is 
applied to improve the modelling of the W/Z boson 
Pt spectrum. The full difference between applying and 
not applying the WjZ boson px reweighting is taken as 
a systematic uncertainty, which is then assumed to be 
symmetric with respect to the central value. Additional 
uncertainties are assigned due to the extrapolation of 
the W/Z+]ets estimate to high jet multiplicity. 


8.3.3 Misidentified lepton background modelling 


Systematic uncertainties on the misidentified lepton back¬ 
ground estimated via the matrix method 38 in the 


single-lepton channel receive contributions from the lim¬ 
ited number of data events, particularly at high jet and 
5-tag multiplicities, from the subtraction of the prompt- 
lepton contribution as well as from the uncertainty on 
the lepton misidentification rates, estimated in differ¬ 
ent control regions. The statistical uncertainty is un¬ 
correlated among the different jet and 6-tag multiplicity 
bins. An uncertainty of 50% associated with the lepton 
misidentification rate measurements is assumed, which 
is taken as correlated across jet and 6-tag multiplic¬ 
ity bins, but uncorrelated between electron and muon 
channels. Uncertainty on the shape of the misidentified 
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(a) (b) 

Fig. 9 Systematic uncertainties on the tt+bb contribution based on (a) scale variations and (b) PDF choice and shower recoil 
model of the SherpaOL simulation. The effect of a given systematic uncertainty is shown across the different tt+bb categories. 
The effect of migration between categories is covered by variations of these systematic uncertainties. 


lepton background arises from the prompt-lepton back¬ 
ground subtraction and the misidentified lepton rate 
measurement. 

In the dilepton channel, since the misidentified lep¬ 
ton background is estimated using both the simulation 
and same-sign dilepton events in data, a 50% normal¬ 
isation uncertainty is assigned to cover the maximum 
difference between the two methods. It is taken as corre¬ 
lated among the different jet and 5-tag multiplicity bins. 
An additional uncertainty is applied to cover the differ¬ 
ence in shape between the predictions derived from the 
simulation and from same-sign dilepton events in data. 


Finally, an uncertainty of ±30% is assumed for the 
theoretical cross sections of the tt + V 70 71 back¬ 
ground. An additional uncertainty on tt + V modelling 
arises from variations in the amount of initial-state ra¬ 
diation. The ti+ Z background with Z boson decaying 
into a 55 pair is an irreducible background to the ttH^ 
H —>■ bb signal, and as such, has kinematics and an NN 
discriminant shape similar to those of the signal. The 
uncertainty on the ti +V background normalisation is 
the fifth leading uncertainty in the analysis. 


8.4 Uncertainties on signal modelling 


8.3.4 Electroweak background modelling 


Uncertainties of ±5%/-4% and ±6.8% are used for the 
theoretical cross sections of single top production in 
the single-lepton and dilepton channels 64 65 , respec¬ 


tively. The former corresponds to the weighted average 
of the theoretical uncertainties on s-, t- and lUt-channel 
production, while the latter corresponds to the theoreti¬ 
cal uncertainty on lUt-channel production, the only sin¬ 
gle top process contributing to the dilepton final state. 

The uncertainty on the diboson background rates 
includes an uncertainty on the inclusive diboson NLO 


cross section of ±5% 62 and uncertainties to account 


for the extrapolation to high jet multiplicity. 


Dedicated NLO PowHel samples are used to evaluate 
the impact of the choice of factorisation and renormal¬ 
isation scales on the ttH signal kinematics. In these 
samples the default scale is varied by a factor of two 
up and down. The effect of the variations on ttH dis¬ 
tributions was studied at particle level and the nomi¬ 
nal PowHel ttH sample was reweighted to reproduce 
these variations. In a similar way, the nominal sample 
is reweighted to reproduce the effect of changing the 
functional form of the scale. Additional uncertainties 
on the ttH signal due to the choice of PDF, parton 
shower and fragmentation model and NLO generator 
are also considered. The effect of the PDF uncertainty 
on the ttH signal is evaluated following the recommen- 
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dation of the PDF4LHC. The uncertainty in the par- 
ton shower and fragmentation is evaluated by compar¬ 
ing Powhel-|-Pythia8 and Powhel-|-Herwig sam¬ 
ples, while the uncertainty due to a generator choice is 
evaluated by comparing Powhel-|-Pythia8 with 
Madgraph5_aMC@NLO |109| interfaced with 


Herwig-I—I- 110 111 


9 Statistical methods 


The distributions of the discriminants from each of the 
channels and regions considered are combined to test for 
the presence of a signal, assuming a Higgs boson mass 
of niH = 125 GeV. The statistical analysis is based on 
a binned likelihood function £(^, 0 ) constructed as a 
product of Poisson probability terms over all bins con¬ 
sidered in the analysis. The likelihood function depends 
on the signal-strength parameter /r, defined as the ratio 
of the observed/expected cross section to the SM cross 
section, and 9, denoting the set of nuisance parame¬ 
ters that encode the effects of systematic uncertain¬ 
ties on the signal and background expectations. They 
are implemented in the likelihood function as Gaus¬ 
sian or log-normal priors. Therefore, the total number 
of expected events in a given bin depends on /r and 9. 
The nuisance parameters 9 adjust the expectations for 
signal and background according to the corresponding 
systematic uncertainties, and their fitted values corre¬ 
spond to the amount that best fits the data. This pro¬ 
cedure allows the impact of systematic uncertainties 
on the search sensitivity to be reduced by taking ad¬ 
vantage of the highly populated background-dominated 
control regions included in the likelihood fit. It requires 
a good understanding of the systematic effects affect¬ 
ing the shapes of the discriminant distributions. The 
test statistic is defined as the profile likelihood ra¬ 
tio: qf_i = —21n(£(^, 9f^)/£{fi, 9)), where fi and 9 are the 
values of the parameters that maximise the likelihood 


function (with the constraints 0 < /t < ^), and 0^ are 
the values of the nuisance parameters that maximise 
the likelihood function for a given value of fj,. This test 
statistic is used to measure the compatibility of the ob¬ 
served data with the background-only hypothesis (i.e. 
for n = 0), and to make statistical inferences about /i, 
such as upper limits using the GLg method 112-114 


as implemented in the RooFix package 115 116 


To obtain the final result, a simultaneous fit to the 
data is performed on the distributions of the discrimi¬ 
nants in 15 regions: nine analysis regions in the single¬ 
lepton channel and six regions in the dilepton channel. 
Fits are performed under the signal-plus-background 
hypothesis, where the signal-strength parameter fj, is 


the parameter of interest in the fit and is allowed to 
float freely, but is required to be the same in all 15 fit 
regions. The normalisation of each background is de¬ 
termined from the fit simultaneously with Contribu¬ 
tions from ti, IF/Z-l-jets production, single top, diboson 
and ttV backgrounds are constrained by the uncertain¬ 
ties of the respective theoretical calculations, the un¬ 
certainty on the luminosity, and the data themselves. 
Statistical uncertainties in each bin of the discriminant 
distributions are taken into account by dedicated pa¬ 
rameters in the fit. The performance of the fit is tested 
using simulated events by injecting ttH signal with a 
variable signal strength and comparing it to the fitted 
value. Good agreement between the injected and mea¬ 
sured signal strength is observed. 


10 Results 

The results of the binned likelihood fit to data described 
in Sect. are presented in this section. Figure [T0| shows 
the yields after the fit in all analysis regions in the 
single-lepton and dilepton channels. The post-fit event 
yields and the corresponding S/B and Sj'/B ratios are 
summarised in Appendix [E| 

Figures [TTHT^ and [T4f[T5| show a comparison of data 
and prediction for the discriminating variables (either 
Ht, or NN discriminants) for each of the regions 
considered in the single-lepton and dilepton channels, 
respectively, both pre- and post-fit to data. The un¬ 
certainties decrease significantly in all regions due to 
constraints provided by data and correlations between 
different sources of uncertainty introduced by the fit to 
the data. In Appendix]^ the most highly discriminat¬ 
ing variables in the NN are shown post-fit compared to 
data. 

Table 1^ shows the observed p, values obtained from 
the individual fits in the single-lepton and dilepton chan¬ 
nels, and their combination. The signal strength from 
the combined fit for rriH = 125 GeV is: 

^(wff = 125 GeV) = 1.5± 1.1. (5) 

The expected uncertainty for the signal strength (/i = 1) 
is ±1.1. The observed (expected) significance of the sig¬ 
nal is 1.4 (1.1) standard deviations, which corresponds 
to an observed (expected) p-value of 8% (15%). The 
probability, p, to obtain a result at least as signal-like 
as observed if no signal is present is calculated using 
qo = —2\n{C{0,9^)/£{fi, 9)) as a test statistic. 

The fitted values of the signal strength and their 
uncertainties for the individual channels and their com¬ 
bination are shown in Fig. |16[ 
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(a) (b) 

Fig. 10 Event yields in all analysis regions in (a) the single-lepton channel and (b) the dilepton channel after the combined 
fit to data under the signal-plus-background hypothesis. The signal, normalised to the fitted /r, is shown both as a filled area 
stacked on the other backgrounds and separately as a dashed line. The hashed area represents the total uncertainty on the 
yields. 


Signal strength 


Uncertainty 

Single lepton 

1.2 

1.3 

Dilepton 

2.8 

2.0 

Combination 

1.5 

1.1 


Table 4 The fitted values of signal strength and their un¬ 
certainties for the individual channels as well as their com¬ 
bination, assuming mu = 125 GeV. Total uncertainties are 
shown. 


The observed limits, those expected with and with¬ 
out assuming a SM Higgs boson with mn = 125 GeV, 
for each channel and their combination are shown in 
Fig. [l^ A signal 3.4 times larger than predicted by the 
SM is excluded at 95% CL using the CLg method. A 
signal 2.2 times larger than for the SM Higgs boson is 
expected to be excluded in the case of no SM Higgs 
boson, and 3.1 times larger in the case of a SM Higgs 
boson. This is also summarised in Table [Sl 


Figure [T^ summarises post-fit event yields as a func¬ 
tion of log^g(S'/H), for all bins of the distributions used 
in the combined fit of the single-lepton and dilepton 
channels. The value of logio(S'/H) is calculated accord¬ 
ing to the post-fit yields in each bin of the fitted dis¬ 
tributions, either Ht, or NN. The total number 

of background and signal events is displayed in bins of 


\ogiQ{S/B). In particular, the last bin of Fig. 18 


eludes the two last bins from the most signal-rich re¬ 
gion of the NN distribution in (> 6j, > 4b) and the two 


last bins from the most signal-rich region of the NN in 
(> 4j,> 4b) from the fit. The signal is normalised to 
the fitted value of the signal strength (/r = 1.5) and 
the background is obtained from the global fit. A sig¬ 
nal strength 3.4 times larger than predicted by the SM, 
which is excluded at 95% CL by this analysis, is also 
shown. 

Figure [^demonstrates the effect of various system¬ 
atic uncertainties on the fitted value of ^ and the con¬ 
straints provided by the data. The post-fit effect on /x 
is calculated by fixing the corresponding nuisance pa¬ 
rameter at 0 ± (Tg, where 9 is the fitted value of the 
nuisance parameter and ag is its post-fit uncertainty, 
and performing the fit again. The difference between 
the default and the modified /i, Z\/i, represents the ef¬ 
fect on fi of this particular systematic uncertainty. The 
largest effect arises from the uncertainty in normali¬ 
sation of the irreducible ti + bb background. This un¬ 
certainty is reduced by more than one half from the 
initial 50%. The tt + bb background normalisation is 
pulled up by about 40% in the fit, resulting in an in¬ 
crease in the observed tt + bb yield with respect to the 
Powheg+Pythia prediction. Most of the reduction 
in uncertainty on the tt + bb normalisation is the result 
of the significant number of data events in the signal- 
rich regions dominated by ti + bb background. With no 
Gaussian prior considered on the ti + bb normalisation, 
as described in Sect. [^ the fit still prefers an increase 
in the amount of ti + bb background by about 40%. 
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Fig. 11 Single-lepton channel: comparison between data and prediction for the discriminant variable used in the (4j, 2b) 
region (a) before the fit and (b) after the fit, in the (4j, 3b) region (c) before the fit and (d) after the fit, in the (4j, 4b) region 
(e) before the fit and (f) after the fit. The fit is performed on data under the signal-plus-background hypothesis. The last bin 
in all figures contains the overflow. The bottom panel displays the ratio of data to the total prediction. An arrow indicates that 
the point is off-scale. The hashed area represents the uncertainty on the background. The tlH signal yield (solid) is normalised 
to the SM cross section before the fit and to the fitted fi after the fit. In several regions, predominantly the control regions, 
the ttH signal yield is not visible on top of the large background. 
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Fig. 12 Single-lepton channel: comparison of data and prediction for the discriminant variable used in the (5j, 2b) region 
(a) before the fit and (b) after the fit, in the (5j, 3b) region (c) before the fit and (d) after the fit, in the (5j,> 4b) region 
(e) before the fit and (f) after the fit. The fit is peformed on data under the signal-plus-background hypothesis. The last bin 
in all figures contains the overflow. The bottom panel displays the ratio of data to the total prediction. An arrow indicates 
that the point is off-scale. The hashed area represents the uncertainty on the background. The dashed line shows ttH signal 
distribution normalised to background yield. The ttH signal yield (solid) is normalised to the SM cross section before the fit 
and to the fitted fj, after the fit. In several regions, predominantly the control regions, the tiH signal yield is not visible on top 
of the large background. 
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Fig. 13 Single-lepton channel: comparison of data and prediction for the discriminant variable used in the (> 6j, 2b) region 
(a) before the fit and (b) after the fit, in the (> 6j, 3b) region (c) before the fit and (d) after the fit, in the (> 6j, > 4b) region 
(e) before the fit and (f) after the fit. The fit is performed on data under the signal-plus-background hypothesis. The last bin 
in all figures contains the overflow. The bottom panel displays the ratio of data to the total prediction. An arrow indicates 
that the point is off-scale. The hashed area represents the uncertainty on the background. The dashed line shows tiH signal 
distribution normalised to background yield. The ttH signal yield (solid) is normalised to the SM cross section before the fit 
and to the fitted fj, after the fit. In several regions, predominantly the control regions, the tiH signal yield is not visible on top 
of the large background. 
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Fig. 14 Dilepton channel: comparison of data and prediction for the discriminant variable used in the (2j, 2b) region (a) 
before the ht and (b) after the fit, in the (3j, 2b) region (c) before the fit and (d) after the fit, in the (3j, 3b) region (e) before 
the fit and (f) after the fit. The fit is performed on data under the signal-plus-background hypothesis. The last bin in all figures 
contains the overflow. The bottom panel displays the ratio of data to the total prediction. An arrow indicates that the point 
is off-scale. The hashed area represents the uncertainty on the background. The dashed line shows tiH signal distribution 
normalised to background yield. The ttH signal yield (solid ) is normalised to the SM cross section before the fit and to the 
fitted n after the fit. In several regions, predominantly the control regions, the tiH signal yield is not visible on top of the large 
background. 
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Fig. 15 Dilepton channel: comparison of data and prediction for the discriminant variable used in the (> 4j, 2b) region (a) 
before the fit and (b) after the ht, in the (> 4j, 3b) region (c) before the fit and (d) after the ht, in the (> 4j, > 4b) region 
(e) before the fit and (f) after the fit. The fit is performed on data under the signal-plus-background hypothesis. The last bin 
in all figures contains the overflow. The bottom panel displays the ratio of data to the total prediction. An arrow indicates 
that the point is off-scale. The hashed area represents the uncertainty on the background. The dashed line shows tin signal 
distribution normalised to background yield. The tin signal yield (solid) is normalised to the SM cross section before the fit 
and to the fitted fj, after the fit. In several regions, predominantly the control regions, the tin signal yield is not visible on top 
of the large background. 
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95% CL upper limit 

Observed 

-2(7 

— 1(7 

Median 

+ lcj- 

-{-‘ 2(7 

Median [fj, = 1) 

Single lepton 

3.6 

1.4 

1.9 

2.6 

3.7 

4.9 

3.6 

Dilepton 

6.7 

2.2 

3.0 

4.1 

5.8 

7.7 

4.7 

Combination 

3.4 

1.2 

1.6 

2.2 

3.0 

4.1 

3.1 


Table 5 Observed and expected (median, for the background-only hypothesis) 95% CL upper limits on a{ttH) relative to 
the SM prediction, for the individual channels as well as their combination, assuming rriH = 125 GeV. The 68% and 95% 
confidence intervals around the expected limits under the background-only hypothesis are also provided, denoted by ilcr and 
±20-, respectively. The expected (median) 95% CL upper limits assuming the SM prediction for cr[ttH) are shown in the last 
column. 
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Fig. 16 The fitted values of the signal strength and their un¬ 
certainties for the individual channels and their combination. 
The green line shows the statistical uncertainty on the signal 
strength. 
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Fig. 17 95% CL upper limits on cF{ttH) relative to the SM 
prediction, tr/crsM, for the individual channels as well as their 
combination. The observed limits (solid lines) are compared 
to the expected (median) limits under the background-only 
hypothesis and under the signal-plus-background hypothesis 
assuming the SM prediction for cr{ttH) and pre-fit predic¬ 
tion for the background. The surrounding shaded bands cor¬ 
respond to the 68% and 95% confidence intervals around the 
expected limits under the background-only hypothesis, de¬ 
noted by ±1(7 and ±2(7, respectively. 
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Fig. 18 Event yields as a function of logj^Q(S/i?), where S 
(signal yield) and B (background yield) are taken from the 
Ht, and NN output bin of each event. Events in all 
fitted regions are included. The predicted background is ob¬ 
tained from the global signal-plus-background fit. The ttH 
signal is shown both for the best fit value {fi = 1.5) and for 
the upper limit at 95% CL (/r = 3.4). 


The tt ± bb modelling uncertainties affecting the 
shape of this background also have a significant effect on 
/X. These systematic uncertainties affect only the tt + bb 
modelling and are not correlated with the other tt±jets 
backgrounds. The largest of the uncertainties is given 
by the renormalisation scale choice. The uncertainty 
drastically changes the shape of the NN for the ti ± bb 
background, making it appear more signal-like. 

The tt+cc normalisation uncertainty is ranked third 
(Fig.[^ and its pull is slightly negative, while the post- 
fit yields for ti+cc increase significantly in the four- and 
five-jet regions in the single-lepton channel and in the 
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Y/Z/ZA Post-fit Impact on p 


Fig. 19 The fitted values of 
the nuisance parameters with the 
largest impact on the measured sig¬ 
nal strength. The points, which are 
drawn conforming to the scale of the 
bottom axis, show the deviation of 
each of the fitted nuisance parame¬ 
ters, 6, from do, which is the nom¬ 
inal value of that nuisance parame¬ 
ter, in units of the pre-fit standard 
deviation Ad. The error bars show 
the post-fit uncertainties, ere, which 
are close to 1 if the data do not pro¬ 
vide any further constraint on that 
uncertainty. Conversely, a value of 
cro much smaller than 1 indicates 
a significant reduction with respect 
to the original uncertainty. The nui¬ 
sance parameters are sorted accord¬ 
ing to the post-fit effect of each on /r 
(hashed blue area) conforming to the 
scale of the top axis, with those with 
the largest impact at the top. 


two- and three-jet regions of the dilepton channel (see 
Tables 10 and 11 of Appendix . It was verified that 
this effect is caused by the interplay between the tt + cc 
normalisation uncertainty and several other systematic 
uncertainties affecting the tt cc background yield. 

The noticeable effect of the light-jet tagging (mistag) 
systematic uncertainty is explained by the relatively 
large fraction of the tt-l-light background in the sig¬ 
nal region with four &-jets in the single-lepton channel. 
The tt-l-light events enter the 4-5-tag region through a 
mistag as opposed to the 3-5-tag region where tagging 
a c-jet from a W boson decay is more likely. Since the 
amount of data in the 4-5-tag regions is not large this 
uncertainty cannot be constrained significantly. 


The tt + Z background with Z —55 is an irre¬ 
ducible background to the ttH signal as it has the same 
number of 5-jets in the final state and similar event 
kinematics. Its normalisation has a notable effect on fi 
{dfi/da{ttV) = 0.3) and the uncertainty arising from 
the tt + V normalisation cannot be significantly con¬ 
strained by the fit. Other leading uncertainties include 
5-tagging and some components of the JES uncertainty. 

Uncertainties arising from jet energy resolution, jet 
vertex fraction, jet reconstruction and JES that affect 
primarily low px jets as well as the tt-l-light-jet back¬ 
ground modelling uncertainties are constrained mainly 
in the signal-depleted regions. These uncertainties do 
not have a significant effect on the fitted value of p. 
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11 Summary 

A search has been performed for the Standard Model 
Higgs boson produced in association with a top-quark 
pair {tiH) using 20.3 fb“^ of pp collision data at -y/s = 

8 TeV collected with the ATLAS detector during the 
first run of the Large Hadron Collider. The search fo¬ 
cuses on H ^ bb decays, and is performed in events 
with either one or two charged leptons. 

To improve sensitivity, the search employs a likeli¬ 
hood fit to data in several jet and ^-tagged jet multi¬ 
plicity regions. Systematic uncertainties included in the 
fit are significantly constrained by the data. Discrimina¬ 
tion between signal and background is obtained in both 
final states by employing neural networks in the signal- 
rich regions. In the single-lepton channel, discriminat¬ 
ing variables are calculated using the matrix element 
technique. They are used in addition to kinematic vari¬ 
ables as input to the neural network. No significant 
excess of events above the background expectation is 
found for a Standard Model Higgs boson with a mass 
of 125 GeV. An observed (expected) 95% confidence- 
level upper limit of 3.4 (2.2) times the Standard Model 
cross section is obtained. By performing a fit under 
the signal-plus-background hypothesis, the ratio of the 
measured signal strength to the Standard Model expec¬ 
tation is found to be /J, = 1.5 ± 1.1. 
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A Higgs boson decay modes 

Figureshows the contributions of different Higgs boson de¬ 
cay modes in each of the analysis regions in the single-lepton 
and dilepton channels. The H ^ bb decay is the dominant 
contribution in the signal-rich regions. 


channel, respectively. All of the plots are made using post-fit 
predictions. 


B Event yields prior to the fit 

The event yields prior to the fit for the combined e-|-jets 
and /i-|-jets samples for the different regions considered in 
the analysis are summarised in Table 

The event yields prior to the fit for the combined ee-fjets, 
/r/i-|-jets and e/i-|-jets samples for the different regions consid¬ 
ered in the dilepton channel are summarised in Table 


C Discrimination power of input variables 

Figures |21f[24| and |25H27| show the discrimination between 
signal and background for the top four input variables in 
each region where NN is used in the single-lepton and dilep¬ 
ton channels, respectively. In Fig. |21[ the NN is designed to 
separate ft-|-HF from ff-f-light. 


D Tables of systematic uncertainties in the 
signal region 

Tables and 1^ show pre-fit and post-fit contributions of the 
different categories of uncertainties (expressed in %) for the 
ttH signal and main background processes in the (> 6j, > 4b) 
region of the single-lepton channel and the (> 4j, > 4b) region 
of the dilepton channel, respectively. 

The “Lepton efficiency” category includes sysmematic un- 
certanties on electrons and muons listed in Table The 
“Jet efficiency” category includes uncertainties on the jet 
vertex fraction and jet reconstruction. The “tt heavy-flavour 
modelling” category includes uncertainties on the ti+bb NLO 
shape and on the tf+cc pt reweighting and generator. The 
“Theoretical cross sections” category includes uncertainties 
on the single top, diboson, T-|-jets and tt V theoretical 
cross sections. The “ttH modelling” category includes contri¬ 
butions from ttH scale, generator, hadronisation model and 
PDF choice. The details of the evaluation of the uncertainties 
can be found in Sect, [s] 


E Post-fit event yields 

The post-fit event yields for the combined single-lepton chan¬ 
nel for the different regions considered in the analysis are 
summarised in Table |10[ Similarly, the post-fit event yields 
for the combined dilepton channels for the different regions 
are summarised in Table m 


F Post-fit input variables 

Figures |28| - [31| and |32| - [34| show a comparison of data and 
prediction for the top four input variables in each region with 
a neural network in the single-lepton channel and dilepton 



32 



4j,2b 

4 j, 3 b 

4j,4b 

tin (125) 

31 ± 3 

13 ± 2 

2.0±0.3 

tt-\- light 

77 000 ±7500 

6200 ± 750 

53 ± 12 

ti -b cc 

4900 ± 3000 

680± 390 

21 ± 12 

ti + bb 

1800 ±1100 

680± 380 

44 ± 25 

Mb-(-jets 

5100 ±3000 

220± 130 

5.5 ±3.3 

Z-l-jets 

1100 ± 600 

50 ± 27 

0.9±0.6 

Single top 

4900 ± 640 

340 ± 60 

6.8±1.6 

Diboson 

220 ± 71 

11± 4.1 

0.2±0.1 

ti+V 

120± 40 

15± 5.1 

0.9±0.3 

Lepton misID 

1600 ± 620 

100± 37 

3.5±1.3 

Total 

96 000 ±9500 

8300 ±1100 

140± 34 

Data 

98 049 

8752 

161 

S/B 

< 0.001 

0.002 

0.014 

s/Vb 

0.099 

0.141 

0.167 



5j, 2 b 

5 j, 3 b 

5j, > 4b 

tiH (125) 

41 ± 2 

23 ± 2 

6.2 ± 0.8 

tt+ light 

38 000 ±5500 

3500 ±520 

61 ± 15 

tt + cc 

4300 ± 2400 

810 ±460 

43 ± 25 

tt + bb 

1700 ± 880 

890 ±480 

110± 63 

TT+jets 

1900 ± 1200 

140± 87 

5.9± 3.9 

Z-(-jets 

410± 240 

29 ± 17 

1.5± 0.9 

Single top 

1900 ± 360 

190± 41 

8.3± 1.3 

Diboson 

97 ± 39 

8.0 ± 3.4 

0.4± 0.2 

ti+V 

150± 48 

26 ± 9 

3.1 ± 1.0 

Lepton misID 

460± 170 

70 ± 28 

8.3± 3.7 

Total 

49 000 ±7000 

5700 ± 980 

250 ± 75 

Data 

49 699 

6199 

286 

S/B 

0.001 

0.004 

0.025 

s/Vb 

0.186 

0.301 

0.397 



> 6 j, 2 

b 

> 6 j,; 

B b 

IV 

C_1. 

IV 

4 b 

ttH (125) 

64 ± 

5 

40 ± 

3 

16 ± 

2 

ti+ light 

19 000 ±4400 

2000 ± 

460 

52 ± 

17 

ti ± cc 

3700 ±2100 

850 ± 

480 

79 ± 

46 

ti ± bb 

1400 ± 

770 

970 ± 

530 

250 ± 

130 

lT±jets 

910± 

620 

97 ± 

66 

8.6 ± 

6.2 

Z±jets 

180 ± 

120 

19 ± 

12 

1.5± 

1.0 

Single top 

840 ± 

220 

120 ± 

35 

12 ± 

3.7 

Diboson 

50 ± 

24 

6.0 ± 

3.0 

0.5 ± 

0.3 

ti+V 

180 ± 

59 

45 ± 

14 

8.5 ± 

2.8 

Lepton misID 

180 ± 

66 

21 ± 

8 

1.1± 

0.5 

Total 

26 000 ±5800 

4200 ±1000 

430 ± 

150 

Data 

26185 


4701 


516 


S/B 

0.002 


0.01 


0.04 


s/Vb 

0.393 


0.63 


0.815 



Table 6 Single lepton channel: pre-fit event yields for signal, backgrounds and data in each of the analysis regions. The quoted 
uncertainties are the sum in quadrature of the statistical and systematic uncertainties on the yields. 
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# 




4j, 4 b 



ATLAS 

Simulation 

mH=125 GeV 
= 8 TeV 




ATLAS 

Simulation 

mH = 125 GeV 
\fs = 8 TeV 


5j, 2b 5j,3b 5j,>4b BtfH, H^bb 3 j, 2 b 3 j, 3 b 

OttH, H^WW 

atfH.H^gg 

nun, H->others \,_J^ 


■ ttH, H^bb 
□ tfH, H^WW 

■ tfH, 


□ tfH, H^gg 

□ tfH, H->others 




• Single lepton 

® ® w 


(a) (b) 

Fig. 20 Contribution of various Higgs boson decay modes to the analysis regions in (a) the single-lepton channel and (b) the 
dilepton channel. 



2j, 2b 

3j, 2 b 

3j,3b 

ttH (125) 

1.5± 

0.2 

5.3± 0.5 

2.2±0.3 

tt+ light 

14 000 ±1800 

8100± 880 

96 ± 21 

ti + cc 

270 ± 

170 

600± 320 

76 ± 44 

tt + bb 

150 ± 

87 

260± 130 

120± 65 

Z-bjets 

330 ± 

30 

190± 49 

8.2±3.1 

Single top 

430 ± 

71 

270± 30 

7.6 ±3.5 

Diboson 

6.8± 

2.2 

4.2± 1.5 

< 0.1±0.1 

ti+V 

8.4± 

2.7 

21± 6 

1.9±0.6 

Lepton misID 

21± 

10 

33 ± 17 

0.8 ±0.4 

Total 

15 000 ±1900 

9500 ±1000 

310± 85 

Data 

15 296 


9996 

374 

S/B 

< 0.001 

0.001 

0.006 

5/Vb 

0.012 

0.053 

0.114 



>4j, 2 b 

> 4 j, 3 b 

> 4j, > 4b 

ttH (125) 

15 ± 

1 

8.6± 0.6 

2.7± 0.3 

ti+ light 

4400 ± 

810 

120 ± 31 

1.9± 0.8 

tt + cc 

710 ± 

380 

130 ± 74 

5.0± 3.0 

ti + bb 

290 ± 

150 

200± 100 

31 ± 17 

Z-bjets 

100 ± 

39 

10 ± 4 

0.6± 0.2 

Single top 

140 ± 

55 

11± 5 

0.8± 0.2 

Diboson 

4.0 ± 

1.3 

0.4± 0.1 

< 0.1± 0.1 

ti+V 

45 ± 

14 

7.8 ± 2.4 

1.1± 0.4 

Lepton misID 

38 ± 

19 

4.3 ± 2.2 

0.4± 0.2 

Total 

5800 ±1000 

490± 140 

43 ± 18 

Data 

6006 


561 

46 

S/B 

0.003 

0.015 

0.059 

S/y/B 

0.197 

0.365 

0.401 


Table 7 Dilepton channel: pre-fit event yields for signal, backgrounds and data in each of the analysis regions. The quoted 
uncertainties are the sum in quadrature of the statistical and systematic uncertainties on the yields. 
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(a) (b) (c) (d) 


Fig. 21 Single-lepton channel: comparison of tf+HF (dashed) and ft-|-light (solid) background for the four top-ranked input 
variables in the (5j, 3b) region where the NN is designed to separate these two backgrounds. The plots include (a) 

(b) mjfj" (c) p^^^R^and (d) 




ATLAS Simulation 

i/s = 8 TeV - jQfal background 

Single lepton 

5j 54 b .rtH (m = 125 GeV) 



(a) 


(b) 


(c) 


(d) 


Fig. 22 Single-lepton channel: comparison of ttH signal (dashed) and background (solid) for the four top-ranked input 
variables in the (5j,> 4b) region. The plots include (a) Centrality, (b) HI, (c) Arlp^and (d) 





(a) 


(b) 


(c) 


(d) 


Fig. 23 Single-lepton channel: comparison of ttH signal (dashed) and background (solid) for the four top-ranked input 
variables in the (> 6j, 3b) region. The plots include (a) (b) Centrality, (c) HI, and (d) SSLL. 
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(a) (b) (c) (d) 

Fig. 24 Single-lepton channel: comparison of ttH signal (dashed) and background (solid) for the four top-ranked input 
variables in the (> 6j, > 4b) region. The plots include (a) Dl, (b) Centrality, (c) and (d) HI. 



Fig. 25 Dilepton channel: comparison of ttH signal (dashed) and background (solid) for the four top-ranked input variables 
in the (3j, 3b) region. The plots include (a) , (b) (c) Ht, and (d) HA. 





Fig. 26 Dilepton channel: comparison of ttH signal (dashed) and background (solid) for the four top-ranked input variables 
in the (> 4j, 3b) region. The plots include (a) (b) (c) ” 2 ™'“ and (d) AR^^^ 
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(a) (b) (c) (d) 


Fig. 27 Dilepton channel: comparison of ttH signal (dashed) and background (solid) for the four top-ranked input variables 
in the (> 4j,> 4b) region. The plots include (a) , (b) mg))" (c) and (d) Ziflg)'" 


> 6j, > 4b 



Pre-fit 

tiH (125) ft-1-light tt + cc 

tt ± bb 

Post-fit 

UH (125) ft ± light tt + cc 

tt ± bb 

Luminosity 

±2.8 

±2.8 

±2.8 

±2.8 

±2.6 

±2.6 

±2.6 

±2.6 

Lepton efficiencies 

±1.4 

±1.4 

±1.4 

±1.5 

±1.3 

±1.3 

±1.3 

±1.3 

Jet energy scale 

±6.4 

±13 

±11 

±9.2 

±2.3 

±5.3 

±4.7 

±3.6 

Jet efficiencies 

±1.7 

±5.2 

±2.7 

±2.5 

±0.7 

±2.3 

±1.2 

±1.1 

Jet energy resolution 

±0.1 

±4.4 

±2.5 

±1.6 

±0.1 

±2.3 

±1.3 

±0.8 

6-tagging efficiency 

±9.2 

±5.6 

±5.1 

±9.3 

±5.0 

±3.1 

±2.9 

±5.0 

c-tagging efficiency 

±1.7 

±6.0 

±12 

±2.4 

±1.4 

±5.1 

±10 

±2.1 

(-tagging efficiency 

±1.0 

±19 

±5.2 

±2.1 

±0.6 

±11 

±3.0 

±1.1 

High pt tagging efficiency 

±0.6 

- 

±0.7 

±0.6 

±0.3 

- 

±0.4 

±0.3 

tt: pt reweighting 

- 

±5.4 

±6.1 

- 

- 

±4.7 

±5.4 

- 

ti: parton shower 

- 

±13 

±16 

±11 

- 

±3.6 

±10 

±6.0 

tt-|-HF: normalisation 

- 

- 

±50 

±50 

- 

- 

±28 

±14 

tt-|-HF: modelling 

- 

±11 

±16 

±8.3 

- 

±3.6 

±9.1 

±7.1 

Theoretical cross sections 

- 

±6.3 

±6.3 

±6.3 

- 

±4.1 

±4.1 

±4.1 

UH modelling 

±2.7 

- 

- 

- 

±2.6 

- 

- 

- 

Total 

±12 

±32 

±59 

±54 

±6.9 

±9.2 

±23 

±12 


Table 8 Single lepton channel: normalisation uncertainties (expressed in % ) on signal and main background processes for the 
systematic uncertainties considered, before and after the fit to data in (> 6j, > 4b) region of the single lepton channel. The total 
uncertainty can be different from the sum in quadrature of individual sources due to the anti-correlations between them. 
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> 4j, > 4b 



Pre-fit 

ttH (125) ft-b light ti+cc 

tt ± bb 

Post-fit 

ttH (125) ft ± light tt + cc 

tt ± bb 

Luminosity 

±2.8 

±2.8 

±2.8 

±2.8 

±2.6 

±2.6 

±2.6 

±2.6 

Lepton efficiencies 

±2.5 

±2.5 

±2.5 

±2.5 

±1.8 

±1.8 

±1.8 

±1.8 

Jet energy scale 

±4.5 

±12 

±9.4 

±7.0 

±2.0 

±5.5 

±4.5 

±3.3 

Jet efficiencies 

- 

±5.9 

±1.6 

±0.9 

- 

±2.6 

±0.7 

±0.4 

Jet energy resolution 

±0.1 

±4.5 

±1.1 

- 

±0.1 

±2.3 

±0.6 

- 

6-tagging efficiency 

±10 

±5.5 

±5.4 

±11 

±5.6 

±3.1 

±3.0 

±5.8 

c-tagging efficiency 

±0.5 

- 

±12 

±0.6 

±0.3 

- 

±10 

±0.3 

/-tagging efficiency 

±0.7 

±34 

±7.0 

±1.6 

±0.4 

±21 

±4.2 

±0.9 

High pt tagging efficiency 

- 

- 

±0.6 

- 

- 

- 

±0.3 

- 

tt: pt reweighting 

- 

±5.8 

±6.2 

- 

- 

±5.0 

±5.4 

- 

ti: parton shower 

- 

±14 

±18 

±14 

- 

±4.8 

±11 

±8.1 

ff-bHF: normalisation 

- 

- 

±50 

±50 

- 

- 

±28 

±14 

tt-bHF: modelling 

- 

±11 

±16 

±12 

- 

±3.8 

±10 

±10 

Theoretical cross sections 

- 

±6.3 

±6.3 

±6.2 

- 

±4.1 

±4.1 

±4.1 

tiH modelling 

±1.9 

- 

- 

- 

±1.8 

- 

- 

- 

Total 

±12 

±40 

±59 

±55 

±6.7 

±22 

±22 

±13 


Table 9 Dilepton channel: normalisation uncertainties (expressed in % ) on signal and main background processes for the 
systematic uncertainties considered, before and after the fit to data in (> 4j,> 4b) region of the dilepton channel. The total 
uncertainty can be different from the sum in quadrature of individual sources due to the anti-correlations between them. 



(a) (b) (c) (d) 


Fig. 28 Single-lepton channel: post-fit comparison of data and prediction for the four top-ranked input variables in the 
(5j, 3b) region. The plots include (a) (b) m™,)" (c) pfp'J^'^^and (d) The first and last bins in all 

figures contain the underflow and overflow, respectively. The bottom panel displays the ratio of data to the total prediction. 
An arrow indicates that the point is off-scale. The hashed area represents the uncertainty on the background. 
































38 



4j, 2 b 

4 j, 3 b 

4j,4b 

UH (125) 

48 ± 35 

20 ± 15 

3.0 ±2.2 

ti-\- light 

78 000 ±1600 

6300 ±160 

56 ± 

5 

tt + cc 

6400 ±1800 

850 ±220 

26 ± 

7 

tt + bb 

2500± 490 

970 ±150 

63 ± 

8 

ly-l-jets 

3700 ±1100 

170± 51 

4.0 ±1.2 

Z-bjets 

1100 ± 540 

49 ± 25 

1.1±0.6 

Single top 

4700± 320 

330± 28 

6.8±0.7 

Diboson 

220± 65 

11 ± 4 

0.3±0.1 

ti+V 

120 ± 38 

16± 5 

0.9 ±0.3 

Lepton misID 

1100± 370 

78 ± 26 

2.6±1.0 

Total 

98 000 ± 340 

8800± 82 

160 ± 

6 

Data 

98 049 

8752 

161 




5j, 2 b 

5 j, 3 b 

5j, > 4b 

ttH (125) 

60 ± 44 

34 ± 25 

9.4 ± 

6.9 

tt+ light 

38 000 ±1000 

3600 ±120 

65 ± 

6 

tt + cc 

4800 ± 1200 

930 ± 230 

51 ± 

12 

tt + bb 

2400 ± 360 

1300 ± 180 

150 ± 

20 

TT+jets 

1200 ± 420 

87 ± 31 

4.0 ± 

1.5 

Z-l-jets 

370± 200 

28 ± 16 

1.4± 

0.8 

Single top 

1700 ± 150 

190± 18 

8.2 ± 

0.7 

Diboson 

94 ± 35 

8.0± 3.1 

0.5 ± 

0.2 

ti+V 

140 ± 43 

26 ± 8 

3.2 ± 

1.0 

Lepton mislD 

340 ± no 

44 ± 16 

5.7± 

2.2 

Total 

50 000 ± 220 

6200± 54 

300 ± 

10 

Data 

49 699 

6199 

286 




> 6j, 2 b 

> 6 j, 3 b 

> 6 j, > 4 b 

ttH (125) 

89 ± 65 

57 ± 42 

24 ± 

17 

ti+ light 

19 000 ±700 

2100± 87 

58 ± 

5 

ti + cc 

3700 ±890 

890 ±210 

85 ± 

21 

ti + bb 

2000 ±310 

1400 ±190 

330 ± 

37 

IL-l-jets 

450 ±170 

51 ± 19 

4.4 ± 

1.9 

Z-bjets 

150± 86 

16± 9 

1.2± 

0.7 

Single top 

730± 83 

110± 14 

11± 

2 

Diboson 

45 ± 20 

5.6± 2.6 

0.5 ± 

0.2 

ti+V 

170± 52 

42 ± 13 

8.2 ± 

2.5 

Lepton misID 

120± 41 

14 ± 5 

1.1± 

0.5 

Total 

26 000 ±160 

4600± 55 

520 ± 

18 

Data 

26185 

4701 

516 



Table 10 Single lepton channel: post-fit event yields under the signal-plus-background hypothesis for signal, backgrounds 
and data in each of the analysis regions. The quoted uncertainties are the sum in quadrature of statistical and systematic 
uncertainties on the yields, computed taking into account correlations among nuisance parameters and among processes. 
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2j, 2b 

3j, 2 b 

3j, 3 b 

ttH (125) 

2.4± 1.8 

8.1 ± 5.9 

3.0 ±2.2 

tt+ light 

14 000 ±160 

8300 ±170 

84 ±9.6 

tt + cc 

400 ±110 

700 ±160 

92 ± 22 

ti -b bb 

190± 36 

350 ± 49 

140 ± 19 

Z-bjets 

330± 22 

200 ± 43 

7.3 ±2.4 

Single top 

430± 35 

260± 21 

7.6±1.5 

Diboson 

6.8± 2.1 

4.5 ± 1.4 

< 0.1±0.1 

tt + V 

8.7± 2.7 

21 ± 6 

1.8±0.6 

Lepton misID 

19 ± 10 

30 ± 15 

0.7±0.4 

Total 

15 000 ±120 

9900 ± 82 

340 ± 14 

Data 

15 296 

9996 

374 



> 4j, 

2 b 

> 4 j, 3 b 

> 4j, > 4b 

ttH (125) 

22 ± 

16 

11± 

8 

3.1 ± 

2.3 

tt+ light 

4500 ± 

150 

100 ± 

12 

1.4± 

0.3 

ti ± cc 

740 ± 

170 

140 ± 

30 

4.8 ± 

1.1 

ti ± bb 

370 ± 

59 

230 ± 

31 

30 ± 

4 

Z+jets 

100 ± 

33 

9.5 ± 

3.1 

0.4 ± 

0.2 

Single top 

140 ± 

23 

11± 

2 

0.6 ± 

0.1 

Diboson 

4.2± 

1.3 

0.3± 

0.1 

< 0.1 ± 

0.1 

ti+V 

43 ± 

13 

7.0 ± 

2.1 

0.9 ± 

0.3 

Lepton mislD 

34 ± 

18 

3.5 ± 

1.8 

0.2 ± 

0.1 

Total 

5900 ± 

65 

520 ± 

18 

42 ± 

4 

Data 

6006 


561 


46 



Table 11 Dilepton channel: post-fit event yields under the signal-plus-background hypothesis for signal, backgrounds and data 
in each of the analysis regions. The quoted uncertainties are the sum in quadrature of statistical and systematic uncertainties 
on the yields, computed taking into account correlations among nuisance parameters and among processes. 
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Fig. 29 Single-lepton channel: post-fit comparison of data and prediction for the four top-ranked input variables in the 
(5j,> 4b) region. The plots include (a) Centrality, (b) HI, (c) N^^Sind (d) The first and last bins in all figures 

contain the underfiow and overflow, respectively. The bottom panel displays the ratio of data to the total prediction. An arrow 
indicates that the point is off-scale. The hashed area represents the uncertainty on the background. The dashed line shows ttH 
signal distribution normalised to background yield. The ttH signal yield (solid) is normalised to the fitted /r. 
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Fig. 30 Single-lepton channel: post-fit comparison of data and prediction for the four top-ranked input variables in (> 6j, 3b) 
region. The plots include (a) (b) Centrality, (c) HI, and (d) SSLL. The first and last bins in all figures contain the 

underflow and overflow, respectively. The bottom panel displays the ratio of data to the total prediction. An arrow indicates 
that the point is ofll-scale. The hashed area represents the uncertainty on the background. The dashed line shows ttH signal 
distribution normalised to background yield. The ttH signal yield (solid) is normalised to the fitted fi. 
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Fig. 31 Single-lepton channel: post-fit comparison of data and prediction for the four top-ranked input variables in (> 6j, > 4b) 
region. The plots include (a) Dl, (b) Centrality, (c) and (d) HI. The first and last bins in all figures contain the underflow 

and overflow, respectively. The bottom panel displays the ratio of data to the total prediction. An arrow indicates that the 
point is off-scale. The hashed area represents the uncertainty on the background. The dashed line shows ttH signal distribution 
normalised to background yield. The ttH signal yield (solid) is normalised to the fitted yt. 
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Fig. 32 Dilepton channel: post-fit comparison of data and prediction for the four top-ranked input variables in the (3j, 3b) 
region. The plots include (a) , (b) (c) Ht, and (d) H4. The first and last bins in all figures contain the 

underflow and overflow, respectively. The bottom panel displays the ratio of data to the total prediction. An arrow indicates 
that the point is off-scale. The hashed area represents the uncertainty on the background. The dashed line shows ttH signal 
distribution normalised to background yield. The tiH signal yield (solid) is normalised to the fitted /r. 
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Fig. 33 Dilepton channel: post-fit comparison of data and prediction for the four top-ranked input variables in the (> 4j, 3b) 
region. The plots include (a) Ar/^^^ , (b) (c) and (d) AR'^^^ The first and last bins in all figures 

contain the underflow and overflow, respectively. The bottom panel displays the ratio of data to the total prediction. An arrow 
indicates that the point is off-scale. The hashed area represents the uncertainty on the background. The dashed line shows ttH 
signal distribution normalised to background yield. The ttH signal yield (solid) is normalised to the fitted /r. 
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Fig. 34 Dilepton channel: post-fit comparison of data and prediction for the four top-ranked input variables in the (> 4j, > 4b) 
region. The plots include (a) , (b) (c) and (d) The first and last bins in all figures contain 

the underflow and overflow, respectively. The bottom panel displays the ratio of data to the total prediction. An arrow indicates 
that the point is off-scale. The hashed area represents the uncertainty on the background. The dashed line shows ttH signal 
distribution normalised to background yield. The ttH signal yield (solid) is normalised to the fitted fi. 
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